170 likes | 256 Views
Persistent Identification in the Dataverse Network Bonn, February 1 st. Rob Grim Research Data Specialist/e-Science Coördinator Executive Manager Open Data Foundation (ODaF) Library and IT Services Tilburg University. Overview. Economists Online (EO)
E N D
Persistent Identification in the Dataverse NetworkBonn, February 1st Rob Grim Research Data Specialist/e-Science Coördinator Executive Manager Open Data Foundation (ODaF) Library and IT Services Tilburg University
Overview Economists Online (EO) Study and Research Data Identifiers in DVN Experiences with Dataverse Discussion
Economists Online... Showcases some of the world's leading institutions, their scholars and their academic publications and datasets. Contains over 900,000 bibliographic references, many with links to open access full text. Combines content with RePEc archives to provide a new information service to the economist Is run by the Nereus consortium and was co-funded by the European Union
Selection of DVN Easy to use for information specialists, having no experience or background working with datasets. Local branding of DVN. Set access conditions (study, group and file level). Dataset citing information. Linking dataset information to publications.
DVN Identifiers Follow data citation standard defined by Altman and King (2007) Data citation consists of human readable and machine actionable components: Gary King; Langche Zeng, 2006, "Replication Data Set for 'When Can History be Our Guide? The Pitfalls of Counterfactual Inference'" hdl:1902.1/DXRXCFAWPK UNF:3:DaYlT6QSX9r0D50ye+tXpA== Murray Research Archive [distributor] Human readable fields: author, title, distributor, year. Machine-readable: handle services (global and local), use of Universal Numerical Fingerprints (UNF).
Persistent Identifiers DVN uses the Handle System (two-level hierarchical service model) Global Handle Registry (GHR) Identifiers are registered under a given naming authority handle (prefix). Local handle services (LHS) Each study is assigned an identifier. Authority handle and study ID are used to resolve to a working url for the study.
Universal Numerical Fingerprint (UNFs) Note: UNF formally is NOT part of the PI (Handle System). Question: Is the dataset at the resolved url the same dataset? UNF changes if the dataset content changes. UNFs are not dependent on the format of the data The UNF is determined by the content of the data. UNFs for dataset subsets and data elements. UNFs convey no information about the data. Reference confidential and proprietary data.
Dataverse Experiencesin practice… A study gets a new handle when copied to another collection. However: We want to cite the same study and dataset in various collections. Co-Authors often have different affiliations. Reference a study f.i. in the catalogue of Tilburg University, Economists Online and University Libre Brussel. Software extension?
Dataverse Experiencesin practice… What about supplementary materials (research objects)? Core to replication, verification of analytical results Link to datasets? Include identifiers for supplementary materials in the UNF?
Discussion Research Data are different from publications. In DVN both the study identifier and the UNF may not change while the network type/distributor may change. Use case EO: 1. storage in DVN 2. local 3. national archive. Note: Responsibility changes for resolving the dataset. What is behind a Handle/DOI? One dataset? More datasets? UNFs may change for a dataset? What is identified by the Handle/DOI?
Discussion What is/should be the policy to publish new DOIs/Handles Extend DVN for DDI 3.*? Why?
References Altman, Micah & King, Gary. D-Lib Magazine. March/April 2007. Volume 13 Number ¾. ISSN 1082-9873. A Proposed Standard for the Scholarly Citation of Quantitative Data. Crosas, M. The Dataverse Network: An Open-Source Application for Sharing, Discovering and Preserving Data. D-Lib Magazine: January/February 2011, Volume 17, Number ½.
Thank you! Further Questions? M: rob.grim@uvt.nl T: +31 13 466 2619