160 likes | 310 Views
Biodiversity Information Standards, TDWG Sep. 26 – Oct. 1, 2010, Woods Hole, MA, USA. Data Citation Mechanism and Services for primary biodiversity data. INFORMATION FACILITY. GLOBAL BIODIVERSITY. Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org. WWW.GBIF.ORG.
E N D
Biodiversity Information Standards, TDWG Sep. 26 – Oct. 1, 2010, Woods Hole, MA, USA Data Citation Mechanism and Services for primary biodiversity data INFORMATIONFACILITY GLOBALBIODIVERSITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org WWW.GBIF.ORG Building the Biodiversity Informatics Commons
Why should I publish data? • Recognition • Opportunities • Investment What is there for me?
Data Publishing Framework • Cultural change towards ‘free and open access’ to biodiversity data • Addresses social, technical, and policy concerns • Answer ‘What is there for me?’ for ALL
Components of Data Publishing FRAMEWORK Chavan and Ingwersen (2009) , BMC Bioinformatics, 10 (Suppl. 14): S2
DPF: Core Technical Components Chavan and Ingwersen (2009) , BMC Bioinformatics, 10 (Suppl. 14): S2
Call for Data Citation • 1979: Dodd, S. A. (1979). Bibliographic references for numeric social science data files: Suggested guidelines. Journal of the American Society for Information Science, 30 (2): 77-82. • 1990: Dodd, S. A. (1990). Bibliographic references for computer files in the social sciences: A discussion paper. Chapel Hill, NC: Institute for Research in Social Science. Retrieved from http://people.virginia.edu/~pm9k/info/compRef.html • 2006/2007: Altman, M. & King, G. (2007). A proposed standard for the scholarly citation of quantitative data. D-Lib Magazine, 13 (3/4). • 2006: Schneider, J. (2006, Spring). Why we need a data citation standard: Lessons learned from compiling ICPSR’s Bibliography of Data-Related Literature. ICPSR Bulletin. Retrieved from http://www.icpsr.umich.edu/files/ICPSR/org/publications/bulletin/2006-Q1.pdf. • 2008: Kelly, M. C. (2008). NISO thought leader meeting on research data. Retrieved from http://www.niso.org/topics/tl/NISOTLDataReportDraft.pdf. • 2009: Green, T. (2009). We need publishing standards for datasets and data tables. OECD Publishing White Paper, OECD Publishing. • 2009: Brase, et al. (2009). Approach for a joint global registration agency for research data. Information Services & Use, 29 (1): 13-27. (i.e, DataCite)
Wish List for Data Citation • Best practice guide for data citation • Persistent identifiers to datasets • Credit to all players from data producers to publishers, aggregators etc. • All levels of granularity and combinations • With or without annotations • Link between traditional literature and data • Coordinated citation support for ALL • Research metrics for datasets
DataONE/DataCite Example DOI resolver and TIB registration 5. URL plus id EZID resolver and registration service 4. save full citation DataCite Member (eg, CDL) 3. citation + URL + id 6. full citation DataONE Coordinating Node metadata catalog (eg, UNM or UCSB) DataONE Member Node data archive (eg, Dryad) 2. metadata + URL + id 7. full citation get unique id string • data + • metadata Research scientist (opt) CDL-hosted EZID id minting service get unique id string
Citation model • When using data from Dryad, please cite the original article. • Sidlauskas, B. 2007. Testing for unequal rates of morphological diversification in the absence of a detailed phylogeny: a case study from characiform fishes. Evolution 61: 299–316. • Additionally, please cite the Dryad data package. The citation should include the following elements: • Author(s) • The date on which the data was deposited • The name of the data file, if applicable • The title of the data package, which in Dryad is always "Data from: [Article name]" • The name "Dryad Digital Repository" • The data identifier • For example: • Sidlauskas, B. 2007. Data from: Testing for unequal rates of morphological diversification in the absence of a detailed phylogeny: a case study from characiform fishes. Dryad Digital Repository. doi:10.5061/dryad.20
Data Citation Mechanism & Service • Deep data citation mechanism • Recognise ALL with their roles • Multilayer citation – producer, publisher, aggregator • Citations within citations • Data Citation Service • Resolve citation any time • Discover the underlined data
Data Citation: Challenges • Dealing with dynamic streaming data? • Resolving to human or machine interpretable description of object? • Need for registry of name spaces? • Can metadata standards support multiple GUIDs? • Failure to enforce data citation as mandatory step in Publishing cycle
Current Biology PhytoKeys Indian J. Mar. Sci. Data Paper: Recognising Data Discovery DoI Publication Journal System Acceptance GBRDS Revision Peer Review GBIF Metadata Repository Submission Registry auto conversion to manuscript Distributed Metadata Catalogues Persistent Identifiers Metadata Authors
Data Publishing together with Scholarly Publishing! Email: vchavan@gbif.org