Event box

Data Cleaning with Open Refine - Online

Data Cleaning with Open Refine - Online


Got messy data? Open Refine is a powerful, free open-source software tool for cleaning and transforming data in a way that is easy to reproduce. If you have ever struggled to remember exactly how you modified your data in Excel, give Open Refine a try!

Learning Objectives

By the end of the class you should be able to:

  • Understand where OpenRefine lives on your computer
  • Use OpenRefine to:
    • Facet data
    • Cluster data
    • Split data into multiple columns
    • Undo changes
  • Export your cleaned data
  • Save your cleaning scripts so they can be re-used 

Prerequisites / Preparation

Please complete the following tasks before coming to class:

  1. Download OpenRefine here: http://openrefine.org/download.html If you are having trouble with the download, you can refer to setup instructions here: https://datacarpentry.org/OpenRefine-ecology-lesson/setup.html or email me at ariel.deardorff@ucsf.edu

  2. Download the 2 class data files from the course website


Ariel Deardorff, Data Services Librarian, UCSF Library

This will be an online class via Zoom conferencing. The zoom link will be sent out a week in advance.

When you register, the Library reserves class space for you. Since many of our classes have a waitlist, please cancel your registration if you can no longer attend.

Related LibGuide: Reproducible Data Management by Ariel Deardorff

Tuesday, July 16, 2019
9:30am - 11:00am
  Data Science Initiative     Data Science Initiative > Data Management  
Registration has closed.

Event Organizer

Profile photo of Ariel Deardorff
Ariel Deardorff

Data Services Librarian