Data Analysis with Python, Pandas, and SQL In-Person / Online
This workshop will be offered on-line over Zoom. All registered participants will recieve a Zoom link after registrations close, typically 24 hours prior to the start of the workshop.
NOTE: Please register for this workshop using your UC or other university email address, if you have one.
Data obtained from the web or other outside sources is often messy and difficult to manage. Pandas, a popular tabular data frame library in Python, and SQL, a structured query language for databases, provide two different but effective tools for getting a better handle on your data. This workshop is designed to introduce you to both tools in the context of managing research data, demonstrate the strengths of each approach, and review how the two approaches can be particularly effective when used in combination.
For more information, please see the course web page at:
This workshop will involve lots of hands on coding wtih SQL and Python. To follow along with the coding exercises, you should be familiar with core Python, at the level covered in Intro to Python Part 1 and Intro to Python Part 2, and basic SQL commands at the level covered in Intro to SQL. If you haven't enrolled in these introductory workshops, you can access the coursework and self-study options on the main DSI Programming web page at:
Digital Humanities Sequence
This sequence is designed for people with no prior programming experience. The workshops start with core python and sql before moving into more advanced or specialized topics in digital humanities, with an emphasis on text analysis and machine learning.
You are welcome to take any of these workshops without attending the entire series. No formal pre-reqs are enforced, but if you are new to Python or SQL, you’ll get the core programming skills necessary for the later workshops by attending Intro to Python Part 1 and Part 2 along with Intro to SQL.
- Intro to Python Part 1:
- Intro to Python Part 2:
- Intro to SQL:
- Data Analysis with Python, Pandas, and SQL:
- Intro to Unix:
- Reading Data from APIs with Python, Pandas, and SQL:
- Machine Learning with Python and Scikit-Learn:
- Python Background for Text Analysis and Natural Language Processing:
- Machine Learning for Document Classification and Sentiment Analysis:
- Google Cloud Tools, BiqQuery, and VertexAI:
- Friday, Mar 15 2024
- 9:00am - 11:00am
- Time Zone:
- Pacific Time - US & Canada (change)