370 likes | 536 Views
How can CERIF facilitate access to institutional archives? Matthew Mascord euroCRIS Seminar 2004, Brussels. Outline of Talk. What is an institutional archive? What is CERIF? How can CERIF help? Case study: CCLRC Conclusion. 1. What is an institutional archive?.
E N D
How can CERIF facilitate access to institutional archives? Matthew Mascord euroCRIS Seminar 2004, Brussels Matthew Mascord, CCLRC, UK
Outline of Talk What is an institutional archive? What is CERIF? How can CERIF help? Case study: CCLRC Conclusion Matthew Mascord, CCLRC, UK
1. What is an institutional archive? Index, store (in a variety of formats) & disseminate intellectual output Publications, data sets, patents, technical reports, theses etc Free access (BOAI), increase impact, framework for preservation (OAIS), resource discovery Matthew Mascord, CCLRC, UK
1. CCLRC's interest • Demonstrating quality of science • Widen public awareness & understanding of science • Digital preservation (DCC, National Data Archives) Matthew Mascord, CCLRC, UK
Find a paper Find a supervisor for a PhD Exploit cutting edge science Re-run experiments or analysis Find collaborators Find a suitable instrument for an experiment Find contact people for press interviews Locate funding sources 1. Potential tasks Matthew Mascord, CCLRC, UK
1. How do they work? Documents Metadata Middleware Web OAI Matthew Mascord, CCLRC, UK
Simple Dublin Core Record Metadata Matthew Mascord, CCLRC, UK
15 optional & repeatable elements "small language for making a particular class of statements about resources". “metadata pidgin for digital tourists” Encoding guidelines: XML, XML/RDF, HTML Language qualification 1. Simple Dublin Core Matthew Mascord, CCLRC, UK
1. Simple DC Matthew Mascord, CCLRC, UK
Which identification scheme? What’s the title in French? Which classification scheme? How do I contact? Current name? Role? Person, organisation or other entity? 1. Simple DC Matthew Mascord, CCLRC, UK
How do I contact this publisher? Which date? Which classification scheme? Which format is the size applicable to? What is CCLRC's role? What is the relation with ISIS? What is SANDALS and how is it related? Who is the contact for SANDALS? 1. Simple DC Matthew Mascord, CCLRC, UK
Additional element: audience Element refinements Encoding schemes Language qualification Dumb-Down Principle 1. Qualified DC Matthew Mascord, CCLRC, UK
1. Qualified DC Matthew Mascord, CCLRC, UK
1. Qualified DC • Which identification scheme? • What’s the title in French? • Which classification scheme? • How do I contact? • Is this the author’s current name? • What role did this person play? • Person, organisation or other entity? Matthew Mascord, CCLRC, UK
1. Qualified DC • How do I contact this publisher? • Other names? e.g. APS • Date of what exactly? • Which classification scheme? • Which format is the size applicable to? • What is CCLRC's role? • What is the relation with ISIS? • What is SANDALS and how is it related? • Who is the contact person for SANDALS? Matthew Mascord, CCLRC, UK
1. Institutional archives: summary • Single-entity databases • 15-element Dublin Core • Related objects of interest referenced by human readable text in DC fields • Free-text search Matthew Mascord, CCLRC, UK
2. What is CERIF? • Common European Research Information Format • Latest version CERIF2000 (1999) • Multi-entity data model for current research information • Template model for new CRISs • Model for data exchange Matthew Mascord, CCLRC, UK
2. What is CERIF? Funding_Programme CV Person Event ExpertiseOrSkill Facility Contact OrgUnit Project Equipment Service Classification_Scheme Result_Publication Classification Result_Product Result_Patent Matthew Mascord, CCLRC, UK
3. How can CERIF help? Result_Publication Person Contact CV role:author ExpertiseOrSkill How do I contact this creator? What role did this creator play? Other publications & expertise? Matthew Mascord, CCLRC, UK
3. How can CERIF help? Result_Publication OrgUnit Contact Equipment role:publisher Service What is the relation with ISIS? What other science is carried out at ISIS? What equipment is needed for an experiment at ISIS & when is it available? Who should I send my ISIS proposal to? Matthew Mascord, CCLRC, UK
3. How can CERIF help? Result_Publication Project Funding_Programme Person What project was the publication a result of? What’s related? Where is the data? Who funded this research? Matthew Mascord, CCLRC, UK
4. Case study: CCLRC • Open Access Institutional archive with ~18000 records accessible on Web: http://epubs.cclrc.ac.uk • Interface to CrossRef (DOIs) • Multi-entity data model mapped onto Simple & Qualified Dublin Core for OAI-PMH • Working to integrate with CERIF, CCLRC’s Corporate Data Repository & CCLRC Data Portal • CCLRC’s record of scientific output Matthew Mascord, CCLRC, UK
4. Case study: CCLRC • CERIF2000 has only title, reference & URI for Result_Publication • Needed to extend Result_Publication & Result_Patent to hold more metadata • Asserson & Jeffery 2002, 2004 proposed CERIF extension • Glue linking CERIF to a more formal DC Matthew Mascord, CCLRC, UK
Funding_Programme CV Person Event ExpertiseOrSkill Facility Contact OrgUnit Project Equipment DC_Resource Classification_Scheme Service DC_Resource_Type Classification DC_Title DC_Format DC_Subject DC_Coverage_Spatial DC_Keywords DC_Coverage_Temporal DC_Description DC_Rights_Management_Security DC_Annotation DC_Rights_Management_Security DC_Rights_Management_Security DC_Rights_Management_Security Matthew Mascord, CCLRC, UK
4. Case study: CCLRC • Also considered FRBR, ONIX & Qualified DC • HEP community preprint heavily: need version control • FRBR introduces 4 level model: work, expression, manifestation item • Also multiple manifestations (e.g. PDF, DOC, PS) Matthew Mascord, CCLRC, UK
4. Case study: CCLRC DC_Resource is realised through FRBR Work is embodied in Expression is exemplified by Manifestation Item Matthew Mascord, CCLRC, UK
4. Case study: CCLRC • Some other extensions to formalised DC: • Structured values for bibliographicCitation e.g. volume, first page etc • Sequence numbers for creators/contributors • Linking to events & event series e.g. conferences & workshops • Serials (journals, report series) as works • Structured/free text where unable to resolve orgunits/projects/people Matthew Mascord, CCLRC, UK
DC_Resource is realised through FRBR Work is embodied in Expression is exemplified by Manifestation Item Classification EventSeries e.g. conference series CERIF ClassificationScheme Event e.g. workshop OrgUnit Person Project Matthew Mascord, CCLRC, UK
creator Person Work hasVersion Expression Work isPartOf(URI) hasFormat Manifestation hasFormat publisher OrgUnit Manifestation Matthew Mascord, CCLRC, UK
Preprint (arXiv) Postprint (Science Direct) Matthew Mascord, CCLRC, UK
CERIF provides context for scientific output Makes scientific knowledge easier to locate Connects scientists through up-to-date contact information Provides alternative pathways to related research knowledge 4. Conclusion Matthew Mascord, CCLRC, UK
Matthew Mascord CCLRC, UK m.mascord@rl.ac.uk http://epubs.cclrc.ac.uk Contact Matthew Mascord, CCLRC, UK