210 likes | 315 Views
D3.1 State of the art assessment on Linked Data and Digital Preservation René van Horik, Data Archiving & Networked Services, The Netherlands. Outline. Introduction / Context Summary of Deliverable D3.1. Objectives of the PRELIDA Project.
E N D
D3.1 State of the art assessment on Linked Data and Digital Preservation René van Horik, Data Archiving & Networked Services, The Netherlands
Outline • Introduction / Context • Summary of Deliverable D3.1
Objectives of the PRELIDA Project • collect, organize and publish use cases related to the long-term access to LD • create a comprehensive state of the art on LD and DP technologies • set up a technology observatory • bring together scientists and stakeholders for identifying relevant challenges and paths for addressing them in the near future • draw attention of standardization bodies
WP3 of Prelida project • Objective of WP3: • toprovideoverview of the state of art in Digital PreservationandLinked Data. • Information transfer betweentwocommunities. • Partners: CNR / APA / HUD / UIBK • Contributors: SotirisBatsakis, David Giaretta, Christophe Gueret, René van Horik, Maarten Hogerwerf, Antoine Isaac, Carlo Meghini, Andrea Scharnhorst. • Deliverables: • WP3.1 State of the art (month 12) • WP3.2 Consolidated state of art (month 24)
Twocommunities: Linked Data & Digital Preservation Onlysensorialexperiencecan lead toknowledge. Pointingto the earth (realism) Appliedmathematicalmethods leads toknowledge. Pointingtoheaven (mysticalnature of the universe) Plato andAristotle. Fragment of fresco “the School of Athens” by Raphael (1509-1510)
Table of contents of D3.1 • Introduction • Definitionsandterminology • Relevant dimensionsaddressedby DP projects • Initialideas on preserving Linked (Open) Data • Use cases • Cedar • Dbpedia • Europeana • Conclusions • Bibliography
Digital Preservation Long Term Preservation as definedby the OAIS reference model: • Components: • Long Term • IndependentlyUnderstandable • Designated Community • Authenticity • Information • Data • Representation Information • (see page 15)
Linked (Open) Data • Use the Web as a platform topublishand re-useidentifiersthatreferto data • Use a single data model forexpressing the data (RDF) • 3 waystopublish RDF data: • As annotationto Web documents(RDF data includedwithin the HTML code of the Web) • As Web documents(RDF documents are served next to HTML documents) • As a database (“triple stores”, query language: SPARQL) (see page 23 forexamplerelatedtoDbpedia)
What is specificforLOD for the DP community? • A number of formats (RDF / Triple Store / …) • No clearboundary • Dynamic, changes over time • Unclearownership
DP topics potentially relevant forpreservation of LD objects • Object classificationandvalidation • Persistent identifiers • Audit & Certification / Trustworthy Digital Repositories
Someinitialideas on preserving LOD • Object classification • Representation information -> Representationnetwork • (Persistent Identifiers) • (Audit & Cetification of TDR)
Classification of objects • must at least make sure we consider different types of data • rendered vs non-rendered • composite vs simple • dynamic vs static • active vspassive RDF Triple: dynamic/complex/non-rendered/passive (page 38)
OAIS Information model: Representation Information The Information Model is key Recursion ends at KNOWLEDGEBASE of the DESIGNATED COMMUNITY (this knowledge will change over time and region) Does not demand that ALL Representation Information be collected at once. A process which can be tested
Use Cases • CEDAR • Dbpedia • Europeana