160 likes | 483 Views
Data Curation at NEES. Claude Trottier Data Curator Site Operations Managers Workshop. Presentation Outline. Data curation accomplishments What data curation looks like Data curation workflow How researchers and local site personnel can contribute to data curation
E N D
Data Curation at NEES Claude Trottier Data Curator Site Operations Managers Workshop
Presentation Outline • Data curation accomplishments • What data curation looks like • Data curation workflow • How researchers and local site personnel can contribute to data curation • Proposed new Data Archiving and Sharing Plan – Attachment E of the RPA • Future goals and tasks
NEES Data CurationAccomplishments Since 9/2006 • Data Curation Application Designed, Coded, and Implemented in NEEScentral • Surveyed and “registered” all existing projects in NEEScentral • Identified Initial Set of “Mature Projects/Experiments” to Curate • Curated 19 Experiments of 6 Projects • Curated 636 files
Definition of Data Curation Preparation of NEES research project contents, and subsequent publications, for facilitated indefinite accessibility and research replication
Data Conduit Reference Model Researcher Verify Aggregate Convert 2.1 Approval -1 Pre-award Before 0 Plan During Experiment 1 Acquire After 2 Post-process 3 Encapsulate 4 Curate 5 Publish 1.3b Backup Repository 1.3a Initial Repository All decisions at the time of experiment that do not affect Equipment Site safety are the responsibility of the Researcher. Equipment Site Repeat for each Trial 1.1 Record DAQ RAW DATA *Responsibility variable depending on project and Equipment Site practices. Responsibilities to be defined in the Research Participation Agreement Variable 0.3 Build Data Archiving & Sharing Plan 1.2 Organize Mapping to Global Data Model Primary Role in Performing Task By 6 months from end of experiment or simulation Repeat for each Experiment CONVERTED DATA 0.2 Design 3.1 Encapsulate 2.2a Interim+ Repository uncurated STRUCTURED DATA 4.1 Register 0.1 Goals 3.2 Central Repository uncurated Mapping to Local Data Model 4.3 Certify 2.2b document CURATED DATA + Interim Repository may reside at the: Equipment Site, Researcher Institution, or Central Repository 4.2 Assign Tags 4.4 Permanent Repository curated Data Librarian Release Primary Responsibility to Assure Task Completion: document process predefined process direct data stored data Data Librarian Researcher Equipment Site Deviations to be specified in the Research Participation Agreement
Data Curation Workflow Researchers Curate Experiment Data Researchers Prepare Data for Uploading to NEEScentral Experiment Executed Data Curation Report Sent to Researcher Data Curator Curates the Experiment Files Experiment Data Uploaded to NEEScentral Within Six Months After Experiment Completion Researcher Reviews Experiment Data Curation and Publishes or Returns to Data Curator Researcher Reviews the Curation and Sends Feedback to the Data Curator Data Curator Revises Experiment Data Curation
Data Curation Checklist • Guide as to what data curation researchers should do prior to uploading experiment files to NEEScentral • Not all items on the checklist apply to every project • Can be used as a communication tool between the researchers and the data curator
Data Curation Checklist Form Full form available in NEEScentral under the NEES Data Curation project/ Public Directory/ Data Curation Checklist folder
Detailed Data CurationTask List • Survey the experiment NEEScentral content • Read documentation • Determine ontology terms • Examine contents of files to decide on title, description, and ontology terms • Form descriptive data file titles based on experiment and/or trial name or title • Associate relevant ontology terms to the file • Associate converted, corrected, and derived files to the source unprocessed (raw) data file
Proposed New Data Archiving and Sharing Plan (DASP) • More structured information than current DASP form • Informs the researchers early on as to what they need to do to conform to the data curation standards • Set file type standards for curation and archiving
Data Curation - the Future • Curate all currently completed experiments • Enhance the Data Curation Application • Publish/present data curation information to the NEES community and wider earthquake engineering universe • Establish productive communication channel with the researchers • Provide NEES management and NSF with visibility into experiment and curation status • Develop long term archiving strategy