1 / 27

PlanetData in a nutshell

PlanetData in a nutshell. Elena Simperl, KIT 1st year review Luxembourg, December 2011. The idea. PlanetData‘s aim and objectives. Aim: establish an interdisciplinary, sustainable European community on large-scale data management Purposeful data exposure Novel and improved applications

cayla
Download Presentation

PlanetData in a nutshell

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. PlanetData in a nutshell Elena Simperl, KIT 1st year review Luxembourg, December 2011

  2. The idea

  3. PlanetData‘s aim and objectives • Aim: establish an interdisciplinary, sustainable European community on large-scale data management • Purposeful data exposure • Novel and improved applications • Objectives • Addressing challenges through integrated research • Data and technology provisioning through PlanetData Lab • Impact through training, dissemination, standardization and networking • Openness and flexibility through PlanetData Programs

  4. Work plan and expected results

  5. Work packages and activities Activity 2 Data Provisioning and Management WP 4 Data Provisioning WP 7 Dissemination and Community Building Activity 1 Research Activity 3 Impact WP 1 Data Streams and Dynamicity WP 2 Context Representation and Quality Assessment WP 3 Provenance and Access Policies WP 6 Training WP 5 Data Management Activity 4 Management WP 8 Project Management

  6. Expected outcomes (i) • Research on publishing and managing new species of interlinked data sets • Methods and techniques to publish, access and manage stream data • Research on improving the usefulness of existing linked data sources • Quality assessment for interlinked data sets (for LOD and stream data), including best practices for the representation and usage of contextual information • Provenance, trust, access control (for LOD and stream data)

  7. Expected outcomes (ii) • Catalogues of data sets and vocabularies, including best practices for publishing and managing self-descriptive data • Catalogues of data provisioning and management tools, including best practices on how to exploit clouds and clusters for distributed and large-scale data management • Linked services and processes as an instrument to develop applications

  8. Expected results (iii) • Yearly summer school co-located with the ESWC • Open training infrastructure • Semantic Web video journal • PlanetData Programs

  9. Examplescenario Provenanceandaccesscontrol SSN ontologies registry C-SPARQL/SPARQL-STR/HTTP Quality control relational DB stream DB stream DB RDF (stream) DB CSV twitter

  10. Simplified scenario Provenance and access control SPARQL SSN Quality control GADM NUTS NeoGeo

  11. Highlights of the first year

  12. Publishing and managing new species of interlinked data sets • W3C SSN ontology documenting RDF data streams • URI definition • Supporting technology • Extensions to SPARQL (SPARQL-STR) • Data stream management systems (e.g., MonetDB) • Transformation/characterisation tools (e.g., Pachube2RDF)

  13. Improving the usefulness of linked data sources • Quality assessmentcombiningdatabasetechniqueswithrequirementsfromthe Web of Data (LOD), and model-basedtechniquesto clean sensordata • GeoVocabtorepresentgeospatialinformation • GADM and NUTS regiondataandservicespublishedwithmappingto relevant datasets in the LODC • Provenanceof SPARQL queriesbased on relational models • Annotation model foraccesscontrolaccesscontrolmechanismtakingintoaccount RDFS entailnment Geovocab.org

  14. Cataloguing Surveys Metadata for Semantic Sensor Networks Vocabularies for datasets and streams Geospatial data, geospatial Ontology http://vocab.cc

  15. Training

  16. Dissemination

  17. PlanetData Programs • 1st Call: 37d proposals submitted with a total requested contribution of almost 3.000.000 € • “Consuming and Quality Assessment of Linked Data in Urban Environments through Games with a Purpose” led by CEFRIEL • “Consuming and Improving Norwegian Linked Open Data for Regional Development and Environmental Friendly Behavior” led by Computas AS • “ParkMe: Linked Open Parking Data” led by Open University

  18. The management

  19. Somefactsandfigures • European Network of Excellence in Call 5 of FP7 • 4 years, started in October 2010 • Budget: 3.7 million €; EC contribution: 3 million €, 0.6 million € allocatedto open calls • 9 partnersfrom 7 European countries

  20. The team

  21. Management structure

  22. Core & associatepartners

  23. Project wiki : wiki.planet-data.eu

  24. The agenda

  25. Agenda: Day 1 • 07-Dec-2011 • 14:45 - 17:00 ImprovingtheusefullnessofexistingLinked Data sets • 14:45 - 15:30 Data qualityandrepair (Pablo Mendes, FUB; and Giorgos Flouris, FORTH) • 15:30 – 16:00 Representingcontextualaspectsofdata (Andreas Harth, KIT) • 16:00 – 16:15 Coffee break • 16:15 - 17:00 Data provenanceandaccesscontrol (Irini Fundulaki, FORTH) • 17:00 - 18:00 TowardsLinked Stream Data (Oscar Corcho, UPM)

  26. Agenda: Day 2 • 08-12-2011 • 09:00 - 09:45 Data sets, vocabulariesandtools (Pablo Mendes, FUB) • 09:45 - 10:30 Dissemination andcommunitybuilding (Lyndon Nixon, STI International) • 10:30 - 11:15 Training (Mitja Jermol, JSI) • 11:15 - 11:30 Coffee break • 11:30 - 12:00 PlanetData Programs (Elena Simperl, KIT) • 12:00 - 12:30 PlanetData outlook (Elena Simperl, KIT) • 12:30 – 13:30 Lunch break • 13:30 – 14:15 Closedsession (PO+reviewers) • 14:15 – 15:00 Feedback session

More Related