210 likes | 331 Views
The need and drive for high quality data publication. COASP, Paris, September 2014. Iain Hrynaszkiewicz Head of Data and HSS Publishing, Open Research Nature Publishing Group & Palgrave Macmillan iain.hrynaszkiewicz@nature.com @ iainh_z. Publishers and data/reproducibility.
E N D
The need and drive for high quality data publication COASP, Paris, September 2014 Iain Hrynaszkiewicz Head of Data and HSS Publishing, Open ResearchNature Publishing Group & Palgrave Macmillan iain.hrynaszkiewicz@nature.com @iainh_z
Publishers and data/reproducibility • Policies on access (to data, code, reagents etc) • Supporting funder & community needs • Format and amount of content • Methodological details, supp info, data integration and links to repositories • Licensing for reuse • Incentives to share • Data citations • Data journals and articles • Quality assurance through peer review
Data/reproducibility and NPG • Some important events • 1996: Bermuda Principles – prepublication sharing • 1998: Structural data accession codes (Nature & Science) • 2002: MIAME-compliant microarray data deposition • 2007: Removal of limitations on Methods sections online • 2009: Ioannidis et al. Nat Gen 41, 2, 149 (2009) • .
Data/reproducibility and NPG • Some important recent events • 2013: Reproducibility checklist, source data from figures • 2014: Endorsing the Joint Declaration of Data Citation Principles • 2014: Launch of Scientific Data
Role of data journals/articles • Credit • Unpublished data • Peer review focus • Value of data vs. analysis • Discoverability • Reusability • Narrative/context • “Intelligently open data”
Scientific Data 2011 Market Research • Scope of survey • How much data researchers produce, in what format and what they do with it • Perceived availability of public repositories • Perceptions of the Scientific Data concept • Level/nature of data journal peer review • Respondent characteristics • 387 respondents (329 active researchers) • Physics (24%), Earth and environmental science (21%), Biology (20%) Chemistry (19%) Others (16%)
Scientific Data 2011 Market Research • Key survey data • 60% share their data with their colleagues • 50% look at other researchers’ datasets at least once a month • 45% unaware of a repository for some of their data • 90% reacted positively to the concept of Scientific Data • 80% believed Scientific Data would increase data deposition rates
Scientific Data 2011 Market Research • Key survey data – what do researchers want from a data publication? • 96% - increased visibility and discovery • 95% - increased usability of their research data • 93% - credit mechanism for deposit of data • 80% -peer review of content/datasets
Get Credit for Sharing Your Data Publications will be indexed and citeable. Open-access Creative Commons licenses (CC-BY/CC-BY-NC) for the main Data Descriptor. Each publication supported by CCO metadata. Focused on Data Reuse All the information others need to reuse the data; no interpretative analysis, or hypothesis testing Peer-reviewed Rigorous peer-review focused on technical data quality and reuse value Promoting Community Data Repositories Not a new data repository; data stored in community data repositories
Scientific Data • ScopeAn open access, peer-reviewed publication for descriptions of scientifically valuable datasets. Our primary article-type, the Data Descriptor, is designed to make your data more discoverable, interpretable and reusable. • Editorial teamManaging Editor (Andrew Hufton)Editorial Curator(Victoria Newman)Honorary Academic Editor (Susanna Sansone, Oxford)Advisory Panel and Editorial Board • Open access article processing charge$1,000 USD / £650 GBP / €750 for each accepted article
The ‘Data Descriptor’ article Detailed descriptions of the methods and technical analyses supporting the quality of the measurements. Does not contain tests of new scientific hypotheses Sections: • Title • Abstract • Background & Summary • Methods • Technical Validation • Data Records • Usage Notes • Figures & Tables • References • Data Citations
Peer review at Scientific Data • Focuses on: • Completeness (can others reproduce?) • Consistency (were community standards followed?) • Integrity (are data in the best repository?) • Experimental rigour and technical quality(were the methods sound?) • Does not focus on: • Perceived impact/importance • Size/complexity of data
The ‘Data Descriptor’ article Experimental metadata or structured component (in-house curated, machine-readable formats) Article or narrativecomponent (PDF and HTML)
Zehr et al. Scientific Data1, Article number: 140019 doi:10.1038/sdata.2014.19
Stem Cells • Associated Nature Article • Data at figshare& NCBI GEO • Integrated figshare data viewer
Neuroscience • New Dataset • Data in OpenfMRI • Source code in GitHub • Big Data Code in GitHub
The right licence Data Descriptor article: Licensed under one of two Creative Commons licenses, by author choice: Metadata: released under the CC0 waiver to maximize reuse and aid data miners Data: depends on public repositories. Partner repositories figshare and Dryad both use the CC0 waiver.
For more information please contact • IAIN HRYNASZKIEWICZHead of Data and HSS Publishing, Open Research • M: +44 (0)7814 290576T: +44 (0)207 0146753E: iain.hrynaszkiewicz@nature.com Thank you