150 likes | 161 Views
Learn how to provide maximal open access to scientific outputs by utilizing institutional repositories and appropriate metadata. Explore the benefits of archiving research data alongside published articles in searchable formats.
E N D
Archiving research data and research publications. • Dr Leslie Carr, Intelligence, Agents Multimedia, University of Southampton • Dr Simon Coles, School of Chemistry, University of Southampton • Dr Liz Lyon, UKOLN, University of Bath RCUK, Octiber 2004
Overview • In an Open Access environment • scientific outputs are openly available • described by appropriate metadata • in Institutional Repositories • harvestable by OAI protocols • Scientists can use the same infrastructure • (here eprints.org software and an existing scientific portal service) • to provide maximal open access • to all their data, as well as their published articles • raw data, intermediate calculations, final results • in a searchable, accessible form • BUT this is subject to ongoing investigation. RCUK, Octiber 2004
Current chemistry publishing protocols Ideas and interpretations Hooks into the literature Raw data! Results & derived data RCUK, Octiber 2004
Presentation services: subject, media-specific, data, commercial portals Searching , harvesting, embedding Resource discovery, linking, embedding Resource discovery, linking, embedding Data creation / capture / gathering: laboratory experiments, Grids, fieldwork, surveys, media Data analysis, transformation, mining, modelling Aggregator services: national, commercial Learning object creation, re-use Harvestingmetadata Learning & Teaching workflows Research & e-Science workflows Repositories : institutional, e-prints, subject, data, learning objects Institutional presentation services: portals, Learning Management Systems, u/g, p/g courses, modules Deposit / self-archiving Deposit / self-archiving Validation Validation Publication Resource discovery, linking, embedding Validation Linking Peer-reviewed publications: journals, conference proceedings Quality assurance bodies Data curation: databases & databanks RCUK, Octiber 2004
Data Overload! EPSRC National Crystallography Service How do we disseminate? The data deluge RCUK, Octiber 2004
CombeChem: An EPSRC pilot project Simulation Video Properties Analysis StructuresDatabase Diffractometer Propertiese-Lab X-Raye-Lab Grid Middleware RCUK, Octiber 2004
RAW DATA DERIVED DATA RESULTS DATA Crystallography workflow • Initialisation: mount new sample on diffractometer & set up data collection • Collection: collect data • Processing: process and correct images • Solution: solve structures • Refinement: refine structure • CIF: produce CIF (Crystallographic Information File format) • Report: generate Crystal Structure Report RCUK, Octiber 2004
Deposition into the archive RCUK, Octiber 2004
An Archive entry ecrystals.chem.soton.ac.uk RCUK, Octiber 2004
All the way back to the underlying data… RCUK, Octiber 2004
Dataset Data flow in eBank Searching, linking and embedding Dataset Dataset dcterms:references Harvesting OAI-PMH oai_dc Crystal structure (data holding) ePrint UK aggregator service Linking Searching, linking and embedding Harvesting OAI-PMH ebank_dc ebank_dc record (XML) Deposit PSIgate portal dc:type=“CrystalStructure” and/or “Collection” eBank UK aggregator service Institutional repository dc:identifier Crystal structure report (HTML) dcterms:isReferencedBy Harvesting OAI-PMH oai_dc Eprint “jump-off” page (HTML) dc:identifier Eprint manifestation (e.g. PDF) Eprint oai_dc record (XML) Subject service dc:type=“Eprint” and/or ”Text” Linking Searching, linking and embedding Model input Andy Powell, UKOLN. RCUK, Octiber 2004
Harvesting: OAIster RCUK, Octiber 2004
Linking and aggregating: Search & discover RCUK, Octiber 2004
Linking and aggregating: Hit browsing RCUK, Octiber 2004
And finally…eBank embedded in a science portal RCUK, Octiber 2004