Event box

Machine Learning for Document Classification and Sentiment Analysis Online

Note: Please register for this workshop with your UC email address. 


This workshop will review the process of preparing, cleaning, and formatting text for natural language processing projects. Topics will include stop words, n-grams, stemming, lemmatization, and other techniques for pre-processing text. Although we will populate and evaluate a machine learning text classification model, empahsis will be on the programming work involved in preparing text to build and populate the model rather than algorithms or analysis.

This workshop will take place online over UCSF Zoom. Registered participants will recieve an email through LibCal with connection information the day before the workshop.


Familiarity with Python.


Geoffrey Boushey with the UCSF Library Data Science Initiative

Digital Humanities Sequence

This sequence is designed for people with no prior programming experience. The workshops start with core python and sql before moving into more advanced or specialized topics in digital humanities, with an emphasis on text analysis and machine learning. 

You are welcome to take any of these workshops without attending the entire series. No formal pre-reqs are enforced, but if you are new to Python or SQL, you’ll get the core programming skills necessary for the later workshops by attending Intro to Python Part 1 and Part 2 along with Intro to SQL. 

Registration Links: 

Friday, Apr 26 2024
9:00am - 11:00am
Time Zone:
Pacific Time - US & Canada (change)
This is an online event. Event URL will be sent via registration email.
  Data Science > Programming  

Registration is required. There are 30 seats available.

Event Organizer

Profile photo of Geoffrey Boushey
Geoffrey Boushey