Event box

Data Cleaning with Open Refine

Got messy data? Open Refine is a powerful, free open-source software tool for cleaning and transforming data in a way that is easy to reproduce. If you have ever struggled to remember exactly how you modified your data in Excel, give Open Refine a try!

Materials and set up instructions available in the CLE: Data Cleaning with Open Refine

 In this class we will cover the basics of:

  • importing data
  • faceting data to discover patterns
  • clustering data (as in cases where NYC and New York City should be the same name)
  • splitting data into multiple columns
  • exporting clean data

Please bring your own laptop!

When you register, the Library reserves class space for you. Since many of our classes have a waitlist, please cancel your registration if you can no longer attend.

Related LibGuide: Data Sharing & Data Management by Ariel Deardorff

Date:
Thursday, February 15, 2018
Time:
1:00pm - 2:30pm
Location:
CL214
Campus:
Parnassus
Categories:
  Data Science Initiative     Data Sharing and Data Management  
Registration has closed.

Event Organizer

Profile photo of Ariel Deardorff
Ariel Deardorff