Event box

Unlocking image, audio, and video data in the Industry Documents Library: a Python based, open source stack

Unlocking image, audio, and video data in the Industry Documents Library: a Python based, open source stack Online

Speaker: Geoffrey Boushey (UC San Francisco)

TA: Marlene Lin (UC San Francisco)

The Industry Documents Library is a digital archive of documents created by industries which influence public health, hosted by the University of California, San Francisco Library. This archive contains millions of video, audio, and image files from the tobacco, opioids, fossil fuel, drug, and food industries, including advertisements, legal depositions, internal marketing documents, public health campaigns, and other historical records. This session will start with a presentation and overview of the contents of the IDL and search interface. Next, we will introduce a python based, open-source stack researchers can use to analyze, transcribe, and categorize data in IDL video, audio, and image files. Although participants will have an opportunity to try out these technologies during the workshop, the primary focus will be an overview of available tools and data, and participation in the programming sections is optional.

To participate in the optional coding section of this workshop, you'll need to be able to run Python code in an interactive notebook. Jupyter notebook on your laptop will work. Alternatively, you can use Google CoLab (available through Google Docs).  

NOTE: Please register with your UC email if you have one - this will help us evaluate participation from different UC Campuses. 

This event is hosted as part of UC Love Data Week 2025, a cross-UC grassroots initiative dedicated to sharing and learning about all things data. For workshop-specific questions, please reach out to the workshop instructors. Can’t attend this event? Recordings from UCLDW 2025 will be available at a later date on the UCLDW Zenodo

 

Date:
Friday, Feb 14 2025
Time:
3:00pm - 4:15pm
Time Zone:
Pacific Time - US & Canada (change)
Location:
Virtual
Campus:
Online
Online:
This is an online event. Event URL will be sent via registration email.
Categories:
  Data Science     Data Science > Programming  

Registration is required. There are 50 seats available.

Event Organizer

Profile photo of Geoffrey Boushey
Geoffrey Boushey