110 likes | 127 Views
Discover why exposing and curating dissertation data is crucial, the benefits it offers, and how to implement this value-adding practice within scholarly communication. Explore the limitations of current ETD workflows and how enhancing curation can boost research impact.
E N D
A Practice and Value Proposal for Doctoral Dissertation Data Curation Aaron Collie Digital Curation Librarian Michigan State University Michael Witt Assistant Professor of Library Science Purdue University
We are going to answer: • Why expose data? • Why dissertation data? • What’s in it for us? • Why can’t we do it now? • How could we do it? • Why are you telling us this?
Proposition Summary • By reengineering ETD workflows to include research data, we can add value to both the process and the products of research • The Electronic Theses & Dissertation (ETD) lifecycle is a scale model of the scholarly communication lifecycle • By enabling curation at scale, we can grapple with the digital curation lifecycle model from within a subsystem of scholarly communication
Why dissertation data? “The role of the library in data-intensive research is important and a strategic repositioning of the library with respect to research support is now appropriate.” Alma Swan & Sheridan Brown. Skills, Role & Career Structure of Data Scientists & Curators: Assessment of Current Practice & Future Needs. JISC, July 2008. "The term ‘digital curation’ is increasingly being used for the actions needed to maintain and utilise digital data and research results over their entire life-cycle for current and future generations of users.” Joint Information Systems Committee (June 2003). JISC Circular 6/03 (Revised). http://www.dcc.ac.uk/docs/6-03Circular.pdf
Value Proposal: What’s in it for us? • Ensures research integrity • Collection of Digital Assets • Option to self-archive data as Open Access • Availability of Data Citations • Preparation for grant writing • Improves research impact
Limiting Context: Why can’t we do it now? Current ETD workflows allow for the students to affix data as a supplementary and disjointed attachment to the theses or dissertation as a final step in the deposition process. Format and file-size restrictions limit the type and amount of data that ETD submission systems will accept. Because data is not hosted until after the final draft has been deposited, persistent unique identifiers are not able to co-link document and data. Because data is treated as supplementary to the ETD, data is only acquired if provided by the author. Examination committee’s are limited by not having data readily (automatically) available during the review process. Data currently inherit the restrictions placed upon the ETD package
ETD Management Systems • ETD Management Systems are content management systems with a specialized suite of services to support ETDs • ETD Management Systems include submission packages • Examples: openETD, vireo, VALET, HYDRA, FEZ
Deposition vs. Curation: How could we do it? Current Proposed
ETDs: A scale-model digital curation environment Similar Route Different Scale Flickr user: contemplative imaging Flickr user: loadstone
Conclusions • ETDs: A scale-model digital curation environment • That is a subsystem of Scholarly Communication • That is a tractable fragment of the research lifecycle • That contains a captive audience • Dissertation data: A low hanging fruit for building and curating data collections • That is aligned with the mission of the university and library • That will help populate fledgling data collections • That has a significant value proposition