270 likes | 415 Views
PlanetData in a nutshell. Elena Simperl, KIT 1st year review Luxembourg, December 2011. The idea. PlanetData‘s aim and objectives. Aim: establish an interdisciplinary, sustainable European community on large-scale data management Purposeful data exposure Novel and improved applications
E N D
PlanetData in a nutshell Elena Simperl, KIT 1st year review Luxembourg, December 2011
PlanetData‘s aim and objectives • Aim: establish an interdisciplinary, sustainable European community on large-scale data management • Purposeful data exposure • Novel and improved applications • Objectives • Addressing challenges through integrated research • Data and technology provisioning through PlanetData Lab • Impact through training, dissemination, standardization and networking • Openness and flexibility through PlanetData Programs
Work packages and activities Activity 2 Data Provisioning and Management WP 4 Data Provisioning WP 7 Dissemination and Community Building Activity 1 Research Activity 3 Impact WP 1 Data Streams and Dynamicity WP 2 Context Representation and Quality Assessment WP 3 Provenance and Access Policies WP 6 Training WP 5 Data Management Activity 4 Management WP 8 Project Management
Expected outcomes (i) • Research on publishing and managing new species of interlinked data sets • Methods and techniques to publish, access and manage stream data • Research on improving the usefulness of existing linked data sources • Quality assessment for interlinked data sets (for LOD and stream data), including best practices for the representation and usage of contextual information • Provenance, trust, access control (for LOD and stream data)
Expected outcomes (ii) • Catalogues of data sets and vocabularies, including best practices for publishing and managing self-descriptive data • Catalogues of data provisioning and management tools, including best practices on how to exploit clouds and clusters for distributed and large-scale data management • Linked services and processes as an instrument to develop applications
Expected results (iii) • Yearly summer school co-located with the ESWC • Open training infrastructure • Semantic Web video journal • PlanetData Programs
Examplescenario Provenanceandaccesscontrol SSN ontologies registry C-SPARQL/SPARQL-STR/HTTP Quality control relational DB stream DB stream DB RDF (stream) DB CSV twitter
Simplified scenario Provenance and access control SPARQL SSN Quality control GADM NUTS NeoGeo
Publishing and managing new species of interlinked data sets • W3C SSN ontology documenting RDF data streams • URI definition • Supporting technology • Extensions to SPARQL (SPARQL-STR) • Data stream management systems (e.g., MonetDB) • Transformation/characterisation tools (e.g., Pachube2RDF)
Improving the usefulness of linked data sources • Quality assessmentcombiningdatabasetechniqueswithrequirementsfromthe Web of Data (LOD), and model-basedtechniquesto clean sensordata • GeoVocabtorepresentgeospatialinformation • GADM and NUTS regiondataandservicespublishedwithmappingto relevant datasets in the LODC • Provenanceof SPARQL queriesbased on relational models • Annotation model foraccesscontrolaccesscontrolmechanismtakingintoaccount RDFS entailnment Geovocab.org
Cataloguing Surveys Metadata for Semantic Sensor Networks Vocabularies for datasets and streams Geospatial data, geospatial Ontology http://vocab.cc
PlanetData Programs • 1st Call: 37d proposals submitted with a total requested contribution of almost 3.000.000 € • “Consuming and Quality Assessment of Linked Data in Urban Environments through Games with a Purpose” led by CEFRIEL • “Consuming and Improving Norwegian Linked Open Data for Regional Development and Environmental Friendly Behavior” led by Computas AS • “ParkMe: Linked Open Parking Data” led by Open University
Somefactsandfigures • European Network of Excellence in Call 5 of FP7 • 4 years, started in October 2010 • Budget: 3.7 million €; EC contribution: 3 million €, 0.6 million € allocatedto open calls • 9 partnersfrom 7 European countries
Project wiki : wiki.planet-data.eu
Agenda: Day 1 • 07-Dec-2011 • 14:45 - 17:00 ImprovingtheusefullnessofexistingLinked Data sets • 14:45 - 15:30 Data qualityandrepair (Pablo Mendes, FUB; and Giorgos Flouris, FORTH) • 15:30 – 16:00 Representingcontextualaspectsofdata (Andreas Harth, KIT) • 16:00 – 16:15 Coffee break • 16:15 - 17:00 Data provenanceandaccesscontrol (Irini Fundulaki, FORTH) • 17:00 - 18:00 TowardsLinked Stream Data (Oscar Corcho, UPM)
Agenda: Day 2 • 08-12-2011 • 09:00 - 09:45 Data sets, vocabulariesandtools (Pablo Mendes, FUB) • 09:45 - 10:30 Dissemination andcommunitybuilding (Lyndon Nixon, STI International) • 10:30 - 11:15 Training (Mitja Jermol, JSI) • 11:15 - 11:30 Coffee break • 11:30 - 12:00 PlanetData Programs (Elena Simperl, KIT) • 12:00 - 12:30 PlanetData outlook (Elena Simperl, KIT) • 12:30 – 13:30 Lunch break • 13:30 – 14:15 Closedsession (PO+reviewers) • 14:15 – 15:00 Feedback session