250 likes | 413 Views
Employing Google Refine to Publish Linked Data. Fadi Maali and Richard Cyganiak. Road map. Self-Service Linked Government Data Google Refine RDF Export Extension RDF Reconciliation Extension. We need government data as Linked Data not just Raw Data. ….aha, and of a good quality!.
E N D
Employing Google Refine to Publish Linked Data Fadi Maali and Richard Cyganiak
Road map • Self-Service Linked Government Data • Google Refine • RDF Export Extension • RDF Reconciliation Extension
We need government data as Linked Data not just Raw Data ….aha, and of a good quality!
We want governments to provide Linked Data not just Raw Data… and of good quality TIME MONEY SKILLS
DIY Recipe Tool support to select datasets of interest and put them into RDF Publishers provide RDF representation of their catalogues User shares the RDF data
DIY Recipe Publishers provide RDF representation of their catalogues Tool support to select datasets of interest and put them into RDF User shares the RDF data dcat
DIY Recipe Tool support to select datasets of interest and put them into RDF Publishers provide RDF representation of their catalogues User shares the RDF data dcat Google Refine + RDF export extension + RDF reconciliation extension
DIY Recipe User shares the RDF data Publishers provide RDF representation of their catalogues Tool support to select datasets of interest and put them into RDF dcat Google Refine Share RDF data along with the conversion process description + RDF export extension + RDF reconciliation extension Provenance & Reproducibility
Road map • Self-Service Linked Government Data • Google Refine • RDF Export Extension • RDF Reconciliation Extension
Google Refine Google Refine is a power tool for working with messy data, cleaning it up, transforming it from one format into another, extending it with web services, and linking it to databases like Freebase* • Desktop application that a user interacts with using a web browser • Open Source (New BSD License) • Extensible *http://code.google.com/p/google-refine/
Demo Top 100 IT university in UK (Guardian data blog http://www.guardian.co.uk/news/datablog/2009/jun/02/universityguide-choosingadegree)
Demo Top 100 Electronic Engineering university in UK (Guardian data blog http://www.guardian.co.uk/news/datablog/2009/jun/02/universityguide-choosingadegree)
RDF Reconcile Extension Sindice search API Silk LSL Crafted RDF Silk Server RDF Reconcile Extension Google Refine SPARQL SPARQL endpoint Hybrid SPARQL SPARQL endpoint with fulltext extension
Benchmarking… Reconciling DailyMed against Dbpedia (SPARQL endpoint)
Benchmarking… Reconciling DailyMed against Sider RDF dump
Conclusion Publishers provide RDF representation of their catalogues Tool support to select datasets of interest and put them into RDF User shares the RDF data dcat Google Refine ?? + RDF export extension + RDF reconciliation extension
Links • Google Refine http://code.google.com/p/google-refine/ • RDF Export Extension http://lab.linkeddata.deri.ie/2010/grefine-rdf-extension/ • RDF Reconciliation Extensionwill be released soon…