Introduction to Text Mining for Social Scientists

Copy the text below to embed this resource

2020-02-14 (Creation date: 2020-02-14)
Main contributor
Helge-Johannes Marahrens
Textual data are central to the social sciences. However, they often require several pre-processing steps before they can be utilized for statistical analyses. This workshop introduces a range of Python tools to clean, organize, and analyze textual data. It is intended for researchers who are new to working with textual data, but are familiar with Python or have completed the Introduction to Python workshop. Computers with Python pre-loaded are available in the SSRC on a first-come, first-served basis.
Indiana University Workshop in Methods
Workshop in Methods
Social Science Research Commons
Related Item
Accompanying materials on IUScholarWorks 


Helge-Johannes Marahrens is a fourth year doctoral student in the department of Sociology at Indiana University. He recently earned an MS in Applied Statistics and is currently working toward a PhD in Sociology. His research interests include cultural consumption, stratification, and computational social science with a particular focus on Natural Language Processing (NLP).

Access Restrictions

This item is accessible by: the public.