Data Cleaning with Open Refine - Online
Got messy data? Open Refine is a powerful, free open-source software tool for cleaning and transforming data in a way that is easy to reproduce. If you have ever struggled to remember exactly how you modified your data in Excel, give Open Refine a try!
By the end of the class you should be able to:
- Understand where OpenRefine lives on your computer
- Use OpenRefine to:
- Facet data
- Cluster data
- Split data into multiple columns
- Undo changes
- Export your cleaned data
- Save your cleaning scripts so they can be re-used
Prerequisites / Preparation
Please complete the following tasks before coming to class:
Download OpenRefine here: http://openrefine.org/download.html If you are having trouble with the download, you can refer to setup instructions here: https://datacarpentry.org/OpenRefine-ecology-lesson/setup.html or email me at email@example.com
Download the 2 class data files from the course website
Ariel Deardorff, Data Services Librarian, UCSF Library
This will be an online class via Zoom conferencing. The zoom link will be sent out a week in advance.
When you register, the Library reserves class space for you. Since many of our classes have a waitlist, please cancel your registration if you can no longer attend.
Related LibGuide: Reproducible Data Management by Ariel Deardorff
- Tuesday, July 16, 2019
- 9:30am - 11:00am