220 likes | 315 Views
Open Data in Agriculture. Hands-on with data infrastructures that can power your agricultural data products. 12/12/2013 Athens, Greece. Supported by EU projects. How to get datasets for agricultural sciences from the CIARD RING portal. Valeria Pesce
E N D
Open Data in Agriculture Hands-on with data infrastructures that can power your agricultural data products 12/12/2013 Athens, Greece Supported by EU projects
How to get datasets for agricultural sciences from the CIARD RING portal Valeria Pesce Global Forum on Agricultural Research (GFAR)
Intro • The CIARD RING is a global registry of information sources and datasets in agriculture • How can consumers of informationfind and retrieve datasets in the RING? • How can IT professionalsget information on the level and mode of interoperability of the datasets and retrieve them from their applications? • How can producers and managers of information register their datasets so their information products can be found?
Objectives This presentation aims to: • Explain how to search datasets in the RING • Explain how other systems and applications can filter and get datasets from the RING automatically • Explain how data managers can register and upload their datasets in the RING
The RING • The CIARD RING is a global registry of information sources and datasets in agriculture, such as search engines, databases, repositories, Open Archives, directories, RSS feeds, open data endpoints… Why • Producers and managers of information need a place where their information products can be found • Consumers of informationneed to find suitable information sources • IT professionals need information on the level and mode of interoperability of information services
What is a dataset Wikipedia: A dataset (or data set) is a collection of data.(Narrow definition) Most commonly a dataset corresponds to the contents of a single database table, or a single statistical data matrix, where each column of the table represents a particular variable, and each row corresponds to a given member of the dataset in question.[…]Nontabular datasets can take the form of marked up strings of characters, such as an XML file. W3C Government Linked Data Working Group (DCAT vocabulary): http://www.w3.org/TR/vocab-dcat/#class--dataset A collection of data, published or curated by a single source, and available for access or download in one or more formats.
Datasets in the RING • Datasets can be registered as standalone data sources or as part of a “collection” (collection managers can register their data sources) • A dataset is identified by: • uniform type of content • uniform criterion of grouping (thematic, geographic, institutional) • uniform data structure (dimensions / metadata set, encoding, reference value lists) • One dataset can be made available / accessible as different “distributions” (format, protocol, URL) • E.g. a CSV file for download, or an XML file available as a feed
Getting a dataset Dataset Access Download or access through indicated protocol
How can other applications get the datasets http://ring.ciard.net/rss/datasets/all http://ring.ciard.net/sparql1test: http://www.gvlab.net/sparql1 DCAT vocabulary (W3C)
Run this query and get OAI-PMH datasets records PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#> PREFIX dc: <http://purl.org/dc/terms/> PREFIX dcat: <http://www.w3.org/ns/dcat#> PREFIX adms: <http://www.w3.org/ns/adms#> PREFIX doap: <http://usefulinc.com/ns/doap#> PREFIX skos: <http://www.w3.org/2004/02/skos/core#> DESCRIBE ?dataset ?distro ?owner ?contact ?topic ?standard ?format ?protocol WHERE { ?dataset rdf:typedcat:Dataset . ?dataset dc:title ?title . ?dataset dcat:distribution ?distro . ?dataset dc:publisher ?owner . ?distrodcat:accessURL ?url . ?distroadms:representationTechnique <http://ring.ciard.net/taxonomy_term/108> . OPTIONAL { ?dataset doap:maintainer ?contact } OPTIONAL { ?dataset dcat:theme ?topic } OPTIONAL { ?distrodc:conformsTo ?standard } OPTIONAL { ?distrodc:format ?format } OPTIONAL { ?distroadms:representationTechnique ?protocol } } URI that identifies the OAI-PMH protocol
Existing systems using RING datasets AGRIS ILRI http://agris.fao.org/content/agris-centershttp://www.ilri.org/ring SZTAKI Drupal module AgroKnow Drupal module
Processing the datasets? • The RING is becoming also a registry of APIs to process datasets • Users will be able to search for APIs to perform the type of processing they need They will get metadata about the technical specification of the APIs In some cases, they will be able to launch the APIs over some selected RING datasets directly from the RING portal Examples of the first APIs that will be available will be agINFRA APIs for: • Harvesting datasets • Transforming datasets from one metadata set to another • Automatically annotating dataset records with AGROVOC terms • Enriching dataset records with VIVO and Google Scholar data • Other workflows implemented in agINFRA
How to register a dataset You need to have a “dataset” that either: • Is available at a URL (for download or through a protocol) • Can be uploaded as a file to the RING Login and click on Add a service / dataset
Thank you! Name Affiliation email