Event date
Thursday, May 11, 2017
Event time
10:00am to 12:00pm
Barrows 371: D-Lab Breakout Room

In this workshop we will cover two main supervised text analysis methods, the dictionary method, and supervised classification. We will use list comprehension to implement the dictionary method, using sentiment analysis as our example. Using the Python library scikit-learn, we will also implement a few supervised classification techniques, including Naive Bayes and Support Vector Machines. Specific skills covered include a) measuring themes in text using dictionaries, b) feature selection, c) Support Vector Machines, d) Naive Bayes, e) cross-validation, and f) feature importance.
Prior knowledge: Basic familiarity with Python is required if you wish to follow along with the tutorial. Completion of D-Lab's Python FUN!damentals workshop series will be sufficient.
This workshop is one of a four-part series that will prepare participants to move forward with text analysis research, with a special focus on humanities and social science applications. Please register for each workshop separately. The other workshops in the series are listed below:
Text Analysis Fundamentals: Methods and Approaches
Text Analysis Fundamentals: Basic Tools and Techniques
Text Analysis Fundamentals: Unsupervised Approaches