300 likes | 455 Views
Pilots to Program: UC San Diego Research Data Curation Pilots and the Library Research Data Curation Program . Mary Linn Bergstrom Matt Critchlow Arwen Hutt Declan Fleming David Minor Don Sutton. Background UC San Diego Research Cyberinfrastructure (RCI) Pilots selection
E N D
Pilots to Program: UC San Diego Research Data Curation Pilots and the Library Research Data Curation Program Mary Linn Bergstrom Matt Critchlow Arwen Hutt Declan Fleming David Minor Don Sutton ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Background • UC San Diego Research Cyberinfrastructure (RCI) Pilots • selection • lessons learned • UC San Diego Library Research Data Curation Program • services • collaborations • future directions • Questions & Discussion ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
2008: campus-wide needs assessment • What do campus users need today? • What do they think they need tomorrow? • What is hindering their research? • 70% indicated need for short-term storage • 1-3 years • 64% indicated need for long-term preservation of data sets • and…data management help, metadata creation, tools for sharing, etc. ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
2009, 2010: reports • April 2009: Blueprint for the Digital University • Publically available http://rci.ucsd.edu/_files/Blueprint.pdf • ‘Provides rationale and design for a campus-wide research cyberinfrastructure’ • April 2010: Cyberinfrastructure Planning and Operations Committee Report • Business plan for operationalizing the Blueprint • Plans, budgets, projections ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
RCI launch Jan 2011 • RCI Oversight Committee established, charged • And funded! • Elements: • High-performance Computing • Data Center Colocation • Storage • Networking & other services • Data Curation ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Data Curation • 2 year pilot phase • Use existing tools whenever possible • Storage at San Diego Supercomputer Center • Chronopolis digital preservation network • Digital Asset Management System at UC San Diego Library • Research Data Curation tools & services • Metadata consultation • Workshops [DMPTool] • DOIs [EZID] ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Pilots • The Brain Observatory • Preserve and curate the digital version of the brain of patient HM, the most studied neuropsychological patient in modern medicine. ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
The Brain Observatory • Aspects of image preservation • Interaction with a commercial site • Combinations of physical slides, images, pyramidal structures ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
NSF OpenTopography • OpenTopography facilitates community access to high-resolution topography data, and related tools and resources. ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
NSF OpenTopography • Preservation of raw data • Provide DOIs for complete datasets • Information passing between portals ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Levantine Archaeology Laboratory • Focuses on archaeological investigations concerning the evolution of societies in the southern Levant from the Neolithic to Islamic periods. ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Levantine Archaeology Laboratory • Cyber-archaeology • Tools for uniting field work, objects in cold storage, and digital imagery • Develop the infrastructure needed to curate cultural heritage data that is enriched by new visualization and analysis tools. ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Scripps Institution of OceanographyGeological Collections • The Cored Sediment Collection contains samples collected since 1916. The collection is a growing archive of sea-floor samples and associated data supporting a diverse variety of scientific research. ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Scripps Institution of OceanographyGeological Collections • Work with local data and a national community. • Assist with the creation of a standards-based access, discovery and preservation system for one of the largest collections of marine geology samples in the United States. ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Laboratory for Computational Astrophysics • Advancing the state-of-the-art of astrophysical simulation through the development and dissemination of community codes, and through large-scale simulations of astrophysical and cosmological systems. ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Laboratory for Computational Astrophysics • Support publishing simulations of astrophysical phenomenon in cosmology, star formation and turbulence • Provide data management and curation to improve collaborations with other researchers • Provide metadata support ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Data Curation ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Digital Asset Management System(DAMS) • Existing technology framework • House and deliver digital objects and associated metadata • Data model challenges • Research data = greater depth and complexity • Categorization and ownership of research datasets • Filter on main application landing page allows browsing of Research or Library collections • Rollout pending, user assessment to follow • Branding and complex collection display • New data model: objects as linked data entities in an RDF triplestore • Hydra framework supplemented to support relationships and nested structure ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
DAMS interface ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Metadata Processes • Pilot phase: extensive consultation • What is an object? • Understandable, usable, reusable • What should be displayed and shared? • Where should Digital Object Identifiers (DOIs) be assigned to support citation? • Best practices and assistance with data organization • Logical, intentional collocation of files, data and metadata • Unique identifiers or naming protocols to clarify relationships & linking • Metadata functionality • Discoverability: controlled vocabularies • Usability: what programs or scripts are required ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Research Data Curation Program • The Research Data Curation Program supports Core Direction 3 of the Library Strategic Plan 2011-2014: • Engage with partners to make digital scholarly work and data openly discoverable and accessible for the long term. • In response to the growing campus-wide data management and data preservation challenges, the Library will actively support open data and open access by collaborating with faculty, researchers, students and other partners to ensure the long-term curation and accessibility of scholarly works in all formats. ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
UC San Diego Libraries 2011 ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
UC San Diego Library 2013 ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Research Data Curation Program • Services • metadata services for complex research data • repository services via the Library DAMS • long-term preservation in Chronopolis [TRAC certified] • Digital Object Identifiers (DOIs) • training on data management, including the use of the Data Management Planning (DMP) Tool ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Research Data Curation Program • In collaboration with • UC San Diego Research Cyberinfrastructure (RCI) • Chronopolis digital preservation network • University of California Curation Center (UC3) • Chronopolis . ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Metadata Services • General data and metadata management expertise and advice • Modular information resources: web based, workshops • Researchers are subject experts • Consultations ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Data Management Plans • Resources and contacts available to UC San Diego researchers • Examples of DMPs from submitted proposals • Recommended language from Office of Contracts and Grants • Guidance, tips and recommendations for DMP preparation ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
EZID subscription • EZID (easy-eye-dee), a UC3 service, makes it easy to create & manage unique, long-term identifiers • store citation metadata for identifiers • update current URL locations so citation links are never broken • use EZID's programming interface for automated operation at scale • choose from a variety of persistent identifiers, including DataCite DOIs ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
In the works • Researcher profiles • Electronic lab notebooks • Ingest functionality • Visualization and integration tools [GIS, etc.] • Data management tools, i.e. DataUp • Data Information Literacy standards • Social media data [Twitter, etc.] • Communication, Education, Collaboration • Assessment ALCTS Scholarly Communications Interest Group ALA MidWinter 2014
Questions & Discussion Mary Linn Bergstrom Liaison Librarian Science & Engineering, Research Data Curation UC San Diego Library mlbergstrom@ucsd.edu ALCTS Scholarly Communications Interest Group ALA MidWinter 2014