310 likes | 537 Views
Some European Partnerships. Partnerships in Innovation II: From Vision to Reality and Beyond PARTNERSHIPS IN RESEARCH David Giaretta October 7- 8, 2008, College Park, Maryland. Outline. CASPAR Alliance for Permanent Access PARSE.Insight Possible European Preservation Infrastructure.
E N D
Some European Partnerships Partnerships in Innovation II: From Vision to Reality and Beyond PARTNERSHIPS IN RESEARCH David Giaretta October 7- 8, 2008, College Park, Maryland
Outline • CASPAR • Alliance for Permanent Access • PARSE.Insight • Possible European Preservation Infrastructure Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
CASPAR Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
Rep • Info /DISCIPLINE Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
Preservation Data Flows and Strategies Preliminary investigation of data holdings Stakeholder and Archive Analysis Identify Preservation Objective Identify Designated Community Create Preservation Info Flow Diagram More strategies than just “emulate or transform” Create Preservation Strategies Cost/Benefit Analysis Preservation activities Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
Creating an OAIS Archival Information Package Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
Understanding information from the bits upwards Villa Livia - UNESCO • 3D point clouds (formats: imp, dxf, dwg) • Elevation grids (agr, bt) • 3D meshes (mdl, vrml, v3d) • Textured 3D models (max, pmr, ive, osg) • Satellite data (ers, ecw) • GPS data, maps (txt, apm, shp) • Digital images (targa, jpeg, tiff, png, psd, bmp, gif, dds) Total data size approximately 500 GB Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
Performing Arts Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland Thanks to ULeeds and CNRS
/DISCIPLINE Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
interpretedUsing User Profile RImodule InfoObject DataObject m1 u1 p1 o1 m2 m4 o2 u2 p2 m3 Understandability/ Knowledge Management • Need to be able to define the OAIS “Designated Community” knowledge base • Knowledge dependencies • Important for sharing knowledge between disparate communities Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland Thanks to FORTH
Example: Identification of an Attribution Right LF1. Written_Norm Art. X of Law Y Legislation is_documented_in 100% precision CR. Activity_Type CR51. Attribution_Right generates To claim authorship Singleton CR20. Perform allows Singleton Work’s Provenance 100% recall, <100% precision has_type has_type E7. Activity Kia claiming authorship E7. Activity E39. Actor F28. Expression_Creation Kia Ng Activity of Improvisation on the Violin performed_by carried_out has_right_type created E30. Right E72. Legal Object CR.Ownership Right is_on F22. Self_contained_Expression Kia’s right to claim authorship Expression of the Improvisation on the Violin Derived Property Rights became_owner_of Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland Thanks to MetaWare FRBRoo Rights Ontology CIDOC-CRM
Click • App Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
ARCHIVE DATA • Create digital information • Create AIP • Create RepInfo • Store AIP in Repository • Could also copy RepInfo to Registry Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
USE DATA • Use application to find data in Repository • Create DIP with enough RepInfo for the user (via DC profile) • Obtain more RepInfo from Registry if necessary Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
CREATE REPINFO • Data holders etc inform Orchestration of dangers to preservation • Orchestration uses Gap Manager to derive implications • Orchestration asks experts for help e.g., • Experts search for or create additional RepInfo needed to fill “gap” & store in RegRep Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
Alliance for Permanent Access • The Alliance aims to develop a shared vision and framework for a sustainable organisational infrastructure for permanent access to scientific information Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
History • 1 November 2004 the Koninklijke Bibliotheek, National Library of the Netherlands, organized the international conference Permanent Access to the Records of Science within the framework of the Netherlands presidency of the EU. • small number of large players only • even smaller group prepared documents • Mid 2005 the Task Force produced two documents: • a proposal for a European R&D programme in the field of long term preservation and • an overall Strategic Action Programme 2006-2010. • In 2006 a small working group was formed to prepare the establishment of the formal Alliance for Permanent Access. • after several attempts prepared a document on the legal status of the Alliance and a draft programme of work. • Late 2007 – members sign up and pay membership • PARSE.Insight project from sub-group of the Alliance Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
Membership The following organisations have so far joined the Alliance for Permanent Access: • The European Science Foundation • European Space Agency • CERN • Max Planck Gesellschaft • Science and Technology Facilities Council • The British Library • Koninklijke Bibliotheek • Deutsche Nationalbibliothek • Joint Information Systems Committee • International Association of Scientific, Technical and Medical Publishers • Digital Preservation Coalition • NESTOR • Netherlands Coalition for Digital Preservation • Portico Membership will be opened up to commercial organisations Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
PARSE.Insight • Funded by EU • 1.25 MEuro, 9 partners, 2 years • Designed to help EU direct e-Sci funding in 2009/2010 • Officially started 1 March 2008 Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
Phases Draft a ROADMAP Use to structure survey questionnaire Conduct survey and case studies to find out what will be funded in any case Analyse results Refine ROADMAP GAP = difference between ROADMAP and Survey results Tool to show relative impact of funding strategies Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
Possible European Preservation Infrastructure Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
Persistent ID resolver RepInfo Registry Authenticity tools Processing Context Certification Orchestration/Brokering Knowledge Gap Manager Persistent ID resolver RepInfo Registry Authenticity tools Processing Context Certification Orchestration/Brokering Knowledge Gap Manager Discipline repositories Storage Compute Resource Local Authentication Local Authorisation WAN LAN Router Switch Cable Interconnects Gateways Management WAN LAN Router Switch Cable Translators Thesauri Cross-references Storage Compute Resource Local Authentication Local Authorisation Repositories Users Automated systems Repositories Users Automated systems Discipline repositories Translators Thesauri Cross-references Resource Registries Process ID Scheduler Shibboleth FUTURE • Users may be unable to understand or use the data e.g. the semantics, format, processes or algorithms involved • Non-maintainability of essential hardware, software or support environment may make the information inaccessible • The chain of evidence may be lost and there may be lack of certainty of provenance or authenticity • Access and use restrictions may fail in the future • Loss of ability to identify the location of data • The current custodian of the data, whether an organisation or project, may cease to exist at some point in the future • The ones we trust to look after the digital holdings may let us down Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
FITS FILE MULTIMEDIA PERFORMANCE DATA FITS DICTIONARY FITS STANDARD DICTIONARY SPECIFICATION C3D DirectX MAX/MSP FITS JAVA s/w PDF STANDARD 3D scene data files 3D motion data files motion to music mapping strategy XML SPECIFICATION PDF s/w JAVA VM UNICODE SPECIFICATION Modules and Dependencies: Examples README.txt ENGLISH LANGUAGE TEXT EDITOR WINDOWS XP Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
Modules and Dependencies: Examples (Semantic Web data) modules and dependencies ns4 ns3 ns2 ns1 RDF/S Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
T tx ty t1 Tu t3 t2 t4 t5 t6 t7 t8 Formalizing Actor/Community knowledge(in terms of modules and dependencies) • Each actor or community u can be characterized by a profile Tu that contains those modules that are assumed to be available/known to u. • Formalization: Tu T (where T is the set of all modules) Examples • u is an artificial agent • Tu may include the software/hardware modules available to it • u is a human, • Tu may include modules that correspond to implicit knowledge Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
C+(tx) = C(tx)- {tx} T tx ty C+(ty) = C(ty)-{ty} t1 Closure of Tu Tu t3 t2 t4 t5 t6 t7 t8 The notion of closure(of modules and profiles) • Closure of a module t: C(t) = all modules on which it depends • Closure of a set of modules S: C(S) = { C(t) | t S } • Required modules of t C+(t) = C(t) - {t} Intelligibility Gap: The smallest set of extra modules that u needs to have in order to understand a module t. Notation: Gap(t,u): The intelligibility gap between a user u with profile Tu and a module t Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
tx ty Reqs of tx tx ty Reqs of ty t1 Closure of Tu t1 Closure of Tu Tu t3 t2 Tu t3 t2 t4 t5 t6 t4 t5 t6 t7 t8 t7 t8 Gap(tx,u)= {t1, t2, t4, t5} Gap(ty,u)= Intelligibility and Intelligibility Gap (I) • u can understand t iff: C+(t) C(Tu) • The intelligibility gap: Gap(t,u) = C+(t)-C(Tu) • This means that: • if we want to preserve a digital object t for a community with profile Tu then we need to get and store only Gap(t,u) plus an id that denotes Tu. • if we want to deliver an object t to an actor with profile Tu, • then the only extra modules that we should deliver to him in order to • return him something intelligible, is the set Gap(t,u). Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
Exploiting DC Profiles for constructingthe “right” AIPs(intelligible and redundancy free) o1 DC profiles could be exploited so that to be able to derive different AIPs and DIPs for different DCommunities (If the dependencies are available this could be done automatically) t1 DC2 ={t3,t5} t2 DC1 ={t2} t3 t4 t5 t6 DC3 ={t7,t8} t7 t8 AIP of o1 wrt DC2 AIP of o1 wrt DC1 AIP of o1 wrt DC3 Object = o1 DCprofile = DC1 deps = {t1,t3} Object = o1 DCprofile = DC2 deps = {t1,t2,t4} Object = o1 DCprofile = DC3 deps = {t1,t2,t3,t4,t5,t6} Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
Scenario: Intelligibility-aware Packaging o1 o2 o3 P3 P1 FITS ZIP DirectX C3D MAX/MSP FITS DICTIONARY DC Profiles • P1 = {FITS} // for astronomers • P2 = {PDF, XML} // for casual users • P3 = {C3D, DirectX, MAX/MSP} Objects • o1 // a pdf document • o2 // a FITS file • o3 // a zip file containing multimedia performance data FITS STANDARD DICTIONARY SPECIFICATION P2 PDF STANDARD XML SPECIFICATION UNICODE SPECIFICATION Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland
Scenario: Intelligibility-aware Packaging o1 o2 o3 P3 P1 • Gap(o2,P1) = • Gap(o2,P2) = • {FITS, FITS_STANDARD, FITS_DICTIONARY, DICTIONARY_SPECIFICATION} • Gap(o2,P3) = • {FITS, FITS_STANDARD, FITS_DICTIONARY, DICTIONARY_SPECIFICATION, PDF_STANDARD, XML_SPECIFICATION, UNICODE_SPECIFICATION} • Gap(o3,P3) = • {ZIP} • Gap(o3, ) = • {ZIP, C3D, DirectX, MAX/MSP} FITS ZIP DirectX C3D MAX/MSP FITS DICTIONARY FITS STANDARD DICTIONARY SPECIFICATION P2 PDF STANDARD XML SPECIFICATION UNICODE SPECIFICATION Partnerships in Innovation II: From Vision to Reality and Beyond October 7- 8, 2008, College Park, Maryland