120 likes | 345 Views
Open Refine: Clean your messy data. Valrie Minson Outreach Librarian for Agricultural Sciences. OpenRefine. OpenRefine.org ( Google Refine) Open Source Runs locally on computer (privacy) Looks like Excel or Google Spreadsheets Data: clean it, transform it, extend it !.
E N D
Open Refine:Clean your messy data Valrie Minson Outreach Librarian for Agricultural Sciences
OpenRefine • OpenRefine.org(Google Refine) • Open Source • Runs locally on computer (privacy) • Looks like Excel or Google Spreadsheets • Data: clean it, transform it, extend it!
Data issues • Human/free-text errors • Inconsistent journal titles • Redundant citations • Data (volume/issue) in wrong fields • ARTICLES IN CAPS LOCK
Filtering/faceting • Use filters or facets to select subsets of data • Journal of Agriculture • Journ of Agriculture • Journla of Agriculture • Agriculture, Journal of • Not just for Messy data
OpenRefine Expression Language (GREL) • Transform list into table (create columns) • Merge datasets • Export into Excel, CSV, OpenOffice, Google Spreadsheets, JSON, RDF, etc. • Use with other systems (Excel, SPSS, etc.) • Great videos
OpenRefine.org Valrie Minson vdavis@ufl.edu