Event box
Data Cleaning with Open Refine - Online In-Person
Overview
Got messy data? Open Refine is a powerful, free open-source software tool for cleaning and transforming data in a way that is easy to reproduce. If you have ever struggled to remember exactly how you modified your data in Excel, give Open Refine a try!
Learning Objectives
By the end of the class you should be able to:
- Understand where OpenRefine lives on your computer
- Use OpenRefine to:
- Facet data
- Cluster data
- Split data into multiple columns
- Undo changes
- Export your cleaned data
- Save your cleaning scripts so they can be re-used
Prerequisites / Preparation
Please complete the following tasks before coming to class:
-
Download OpenRefine here: http://openrefine.org/download.html If you are having trouble with the download, you can refer to setup instructions here: https://datacarpentry.org/OpenRefine-ecology-lesson/setup.html or email me at ariel.deardorff@ucsf.edu
-
Download the 2 class data files from the course website
Instructors
Ariel Deardorff, Data Services Librarian, UCSF Library
This will be an online class via Zoom conferencing. The zoom link will be sent out a week in advance.
When you register, the Library reserves class space for you. Since many of our classes have a waitlist, please cancel your registration if you can no longer attend.
Related LibGuide: Reproducible Data Management by Ariel Deardorff
- Date:
- Tuesday, Jul 16 2019
- Time:
- 9:30am - 11:00am
- Time Zone:
- Pacific Time - US & Canada (change)
- Categories:
- Data Science Data Science > Data Management