140 likes | 275 Views
Dataset Announcement Notice Submissions. A CASE STUDY. Giri Palanisamy Mark Martin STIP Annual Working Meeting April 2012.
E N D
Dataset Announcement Notice Submissions A CASE STUDY GiriPalanisamy Mark Martin STIP Annual Working Meeting April 2012
“Beyond broad availability of technical reports, e-prints and multimedia, and publication in peer-reviewed journals, open access to experimental data and analysis codes is increasingly important in policy-relevant research areas.”2011 Department of Energy Strategic Plan, Assure Excellence in R&D Management Why Cite Data? Data should be cited in just the same way that other sources of information, such as articles and books, are cited. • Data citation: • enables easy reuse and verification of data • allows the impact of data to be tracked • creates a scholarly structure that recognizes and rewards data producers
A global consortium composed of local institutions focused on improving the scholarly infrastructure around datasets and other non-textual information. OSTI is the only U.S. Federal Agency member of DataCite • Science is global • Global standards • Global workflows • Cooperation of global players • Science is carried out locally • By local scientists • As part of local infrastrucures • Having local funders AND
Member Institution Member Institution Data Centre Data Centre Data Centre Data Centre Data Centre Data Centre How is DataCite Structured? International DOI Foundation Member Managing Agent(TIB) DataCite Composes AssociateStakeholder Works with …
DataSets With the Department’s leadership in data-intensive science involving simulation, modeling, and computer-driven experimentation, we are generating large quantities of data. Open access to this raw data is needed and expected so that scientists can quickly build on each other’s work without duplicating their work. Data ID Service Service Established for DOE Datasets Registration & announcement of datasets of R&D results & STI we’ve made accessible for the past 65 years...
Data ID Service Oak Ridge National Laboratory Atmospheric Radiation Measurement (ARM) datacenter • First DOI minted 8/10/2011 • Over 350 ARM datasets are now registered
Arm Archive • DOIs allow the users to more directly cite the exact ARM data that they have used in their research. • DOIs also allows the future data users and the ARM program to easily track the data used in various publications The Challenge: • Millions of data files from over 3000 data products • Most of them are continuous datastreams • Large user community (Climate change model community) • Data is also published via other portals
Arm strategy for DOIs - DOI provided by OSTI and assigned at the ARM data product level - DOIs are presented in the ARM datastream pages and field campaign readme files. - DOIs will also be sent via Archive data notification email. Demo
How TO cite arm data Several citation formats are possible using DOIs. ARM encourages users to include the following information when citing ARM data: • Author • Original publication date • Update period, if applicable (daily, monthly, quarterly, yearly, etc.) • Dataset name • Dates used* • Locations* (latitude/longitude, site name, and facility identifier) • Editor(s) or compiler(s) • Place of publication • Publisher • Date accessed* • DOI* * Needed for future replication of data requests More Information: http://www.arm.gov/data/docs/doi-guidance
Example of Scientific Impact ORNL DAAC: Data Products used in literature ORNL DAAC requests that data be cited in list of references; some authors “refer” to data in text or acknowledgements Number
Data specific DOI Documents • http://ipydis.org/data/citations.html • http://www.arm.gov/data/docs/doi-guidance • http://www.cdlib.org/services/uc3/ezid/identifiers.html