180 likes | 192 Views
Explore the evolving data environment and the importance of data stewardship and management plans, with insights on repositories, metadata, and access capacity building. Learn about international initiatives and the need for investment in national data platforms. Discover the role of collections, repositories, and infrastructure in effective data management.
E N D
The Changing Data Environment Train the Trainers Montreal February 2010 Chuck Humphrey University of Alberta
Data’s shameful neglectNature, Sep 10, 2009 Metadata Life cycle Data citation Data sharing Data repurposing Data archiving Data repository Data management plans Access Capacity building Preservation
Data stewardship infrastructure International initiatives that make me envious U.S. NSF DataNet Europe’s Digital Repository Infrastructure Vision for European Research (DRIVER) DRIVER II: Federated data repositories Where’s Canada? We need an investment by CFI into a national platform for data stewardship
Information Authenticity Quality Longevity Collections: data, work-flows, publications, learning materials, etc. Repositories services Ease of use Availability Reliability Deposit, annotation, delivery, visualisation, search, help, etc Trusted Open Well managed Repositories Repository management, curation, physical security, etc Standardised Stable Flexible Access Authentication, authorisation, logical security, federation, portals, etc Transparent Responsive Informed Management Grids, Virtual Organisations, etc Physical infrastructure Available Scaleable Reliable Networks, computing, HPC, physical storage, etc e-Infrastructure for repositories e-Infrastructure of repositories e-Infrastructure for repositories 5 Source: Mário Campolargo Open Grid Forum Barcelona, 3 June 2008 source: eSciDR study (adapted)
Data: the power of collections • Large science data centres, like the Chandra Data archive, are successful because they consist of an organized, managed and preserved collection. • Statistics Canada discovered the power of collections through academic library participation in DLI and the RDC Network.
No data file left behind • Data centre model • Astronomy example • The data centre is part of the instrumentation infrastructure • Data repository model • Islandora • Scholar’s Portal Odesi • Domain archives • Omnibus collections like the ICPSR & UK DA
Data Services 8 http://www.dcc.ac.uk/lifecycle-model/
This table lists changes to the stages in the DCC model, re-aggregating activities in the lifecycle to create a data library viewpoint. Data Services 10
Data stewardship lifecycle Reference Data Discovery Data Repurposing Data Dissemination Data Production Collections management Data Repository 11
Data management services • International initiatives that make me envious • Australia ANDS • UK RDS • What should Canada be doing? • Develop data management planning methods and protocols • Providing national services, such as DataCite • Building capacity
Research Data Strategy Working Group Initiated by Pam Bjornson, CISTI Executive Director Cross-sector working group consisting of members from government departments & agencies and research libraries Stewardship of Research Data in Canada: A Gap Analysis (January 2009). Uses a lifecycle model to identify data problems in Canada. RDSWG reorganized in anticipation of the release of the Gap Analysis in fall 2008. Task Group 1: Engagement strategy Task Group 2: Policies, funding and reward systems Task Group 3: Infrastructure and services Task Group 4: Capacity 13
Gap analysis summary 14 Source: The Stewardship of Research Data in Canada: a gap analysis, Table 2, page 17.
CARL Data Management Working Group Members Marnie Swanson, Chair (U of Victoria) Pam Bjornson (CISTI) Lynn Copeland (SFU) Michelle Edwards (U of Guelph) Observers Bernie Gloyn (Statistics Canada) Margaret Haines (Carleton U) Janine Schmidt (McGill U) Kathleen Shearer (CARL consultant) Produced the Data Management Awareness Toolkit 15
Metadata: making data smart • Standards for documenting the lifecycle • CASRAI: the Consortia Advancing Standards in Research Administration Information • DDI 3.0 • COPPUL’s Table-to-Data (T2D) initiative • Tools to exploit metadata • Metadata needed to automate aspects of asset management • Example of the risk management of confidential data • Track the lifecycle documenting risk factors
DLI Training: making data service providers smarter • Throughout these environmental changes, one mission of DLI Training is to support data service providers in making the transition to the new ways in which access is provided to research data and in which data are preserved.