Event box

Machine Learning with Python: Feature Importance and Random Forests

Machine Learning with Python: Feature Importance and Random Forests In-Person / Online

In this workshop, you'll learn how to apply machine learning techniques using Python and scikit-learn, focusing on feature importance and Random Forests. Using the COVID testing dataset, you'll explore how to identify key features that influence outcomes, and how to build and evaluate a Random Forest model for analysis. Throughout the session, we’ll integrate AI-generated code to assist with the modeling process. By the end of the workshop, you’ll have practical experience in using scikit-learn to build predictive models and analyze the significance of different features in the dataset.

Note: This workshop will be offered in person at the UCSF FAMRI Library at Mission Bay and online through UCSF Zoom. You will receive an email from LibCal with room information and Zoom link prior to the start of the workshop. 

 

Workshop Series: Data and Document Analysis with Python, SQL, and AI

This series is designed for researchers, staff, and students who want to learn Python from the ground up, focusing on data analysis, document analysis, and AI-based research. Throughout the workshops, we’ll dive into various data types—including numerical data, health data, text, and images/videos—and explore analysis techniques like regression, classification, and sentiment analysis. We’ll put a particular focus on text analysis, especially analyzing documents from our industry document library.

As programming evolves, so does the way we learn it. While we’ll introduce AI-assisted programming earlier in the series than in past workshop series, we’ll still start with the basics. The early workshops (Intro to Python Parts 1 & 2 and Intro to SQL) will use AI sparingly. As the series moves forward, we’ll dive into topics like web APIs, text analysis, natural language processing, machine learning, regression, document analysis, and AI system interaction, with more AI-driven techniques along the way.
 

 

Date:
Friday, May 16 2025
Time:
9:00am - 11:00am
Time Zone:
Pacific Time - US & Canada (change)
Location:
Virtual
Campus:
Online
Categories:
  Data Science     Data Science > Programming  

Registration is required. There are 19 in-person seats available. There are 39 online seats available.

Event Organizer

Profile photo of Geoffrey Boushey
Geoffrey Boushey