1 / 22

The Data Management Plan (DMP) and your NSF proposal

The Data Management Plan (DMP) and your NSF proposal. Raleigh L. Martin - AAAS S&T Policy Fellow, NSF Geosciences AUGUST 2, 2019 NSF Early Career Workshop. This presentation does not reflect an official position of AAAS or of the U.S. National Science Foundation.

shawnb
Download Presentation

The Data Management Plan (DMP) and your NSF proposal

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. The Data Management Plan (DMP) and your NSF proposal Raleigh L. Martin - AAAS S&T Policy Fellow, NSF Geosciences AUGUST 2, 2019 NSF Early Career Workshop This presentation does not reflect an official position of AAAS or of the U.S. National Science Foundation.

  2. Introduction: What is a Data Management Plan (DMP)? Data collection overview • Dataset 1: • Data type • Formats & standards • Access & sharing • Policies for reuse • Archiving & preservation • Plan for generating, sharing, and archiving project data • Required 2 page supplementary document for all NSF proposals (single DMP across collaborative) • NSF PAPPG (Proposal & Awards Policies & Procedures Guide) gives general policy, GEO Divisions & Programs define specifics Dataset 2 Dataset 3 ... Sample template for DMP

  3. Presentation Overview • Preparing the DMP for your proposal • Executing the DMP for your awarded project • Additional considerations for DMPs

  4. I. Preparing The DMP for your Proposal • Outline your DMP – What datasets to be generated? Where and how to manage them? • Plan for each data object – Data standards, access timelines, preservation • Integrate DMP with your broader proposal – Feasibility of management, budget, etc.

  5. Preparing The DMP for your Proposal • Outline your DMP – What datasets to be generated? Where and how to manage them? • Plan for each data object • Integrate DMP with your broader proposal

  6. Data collection overview A. Outline YOUR DMP • Dataset 1: Raw sensor records • Only needed for short-term analysis • Store on university server • List out the data to be generated – e.g., raw data, processed data, software, physical samples, model output, curricula, ... • Determine the value of each data object – What is required to support publications? What is valuable for long-term reuse? What can be discarded? • Identify resources for managing data – e.g., university server for short-term, disciplinary data repository for long-term preservation of specialized data, general purpose repository for other data • Dataset 2: Analysis software • Necessary for reproducibility • Store in general-purpose repository • Dataset 3: Processed dataset • Valuable for long-term reuse • Store on disciplinary data repository

  7. What “data” Should be shared? “Investigators are expected to share with other researchers, at no more than incremental cost and within a reasonable time, the primary data, samples, physical collections and other supporting materials created or gathered in the course of work under NSF grants.” • Policy does NOT require that all data be shared forever • Exceptions/accommodations for “privileged or confidential information” • Specific division data policies and/or program solicitations provide further details NSF 19-1 Proposal & Award Policies & Procedures Guide (Jan 2019), Chapter XI.D.4.b

  8. Where can the data go? • “Domain” repository (e.g., Arctic Data Center, Biological and Chemical Oceanography Data Management Office, Hydroshare) – This is ideal if your data fit within the discipline of the repository, and it may be required for certain programs. • Institutional repository (e.g., university, museum) – This may be required by your university of research institution. • General repository (e.g., Figshare, Zenodo, Dryad) – A good backup option. • Journal article: • Tables and figures – good for very high-level information, but not machine readable • Article supplements – not advised: very difficult to access, often behind paywall

  9. NSF/GEO Division-specific requirements Directorate for Geosciences—Data Policies (https://www.nsf.gov/geo/geo-data-policies/) Note that many GEO programs specify additional data requirements through their solicitations.

  10. Preparing The DMP for your Proposal • Outline your DMP • Plan for each data object – Data standards, access timelines, preservation • Integrate with your broader proposal

  11. B. Plan for each data OBJECT in your DMP NSF PAPPG (Section II.C.j) – suggested elements for DMP • Type of datato be produced (e.g., raw data, analyzed data, model outputs, software, physical samples, curricula) • Data and metadata standards to be used (i.e., file formats, disciplinary standards, coding language, etc.) • Policy for access and sharing (i.e., repository selection, protocol for access and citation, timeline of availability) • Policy for reuse and distribution (i.e., licenses, reuse limitations) • Plan for archiving and preservation (i.e., forever or a finite period?)

  12. Example: Processed Dataset Project: Integrating sensor data to understand wind-driven sediment transport Data collection overview Dataset 1: Raw sensor records Dataset 2: Analysis software Dataset 3: Calibrated sediment flux time series Type of data: Processed sediment flux time series data, derived from raw sensor measurements Data standard: Comma-delimited text file (.csv) Data access: Freely available on Zenodo.org upon publication of accompanying article or within 2 years of collection (whichever is sooner), to be assigned DOI for citation Data reuse: CC-BY license (anyone may reuse with attribution) Data preservation: Long-term (>10 years), responsibility of Zenodo.org Wind Sediment flux

  13. Preparing The DMP for your Proposal • Outline your DMP • Plan for each data object • Integrate DMP with your broader proposal – Feasibility of project management, budget, etc.

  14. C. Integrate DMP with your broader proposal • Results of Prior NSF Support– If you have previously been supported on an NSF award, products of your past data sharing should be listed in this section of the Project Description (see PAPPG II.C.2.iii.(e)) • Biographical Sketch (“Biosketch”)– “Products” listed on your biosketch can include data sets and other digital products (see PAPPG II.C.2.f.(c)) • Budget Justification–This should indicate appropriate allocation of time and resources for data management. NOTE: Budget may include data deposit fees (though most NSF-supported repositories do not charge such fees) • Letters of Collaboration – Not usually needed, but could be helpful if DMP indicates use of data facility resources beyond typical capacity

  15. Presentation Overview • Preparing the DMP for your proposal • Executing the DMP for your awarded project • Additional considerations for DMPs

  16. II. Executing the DMP for your awarded project • Initiating your project– Coordinate with your project team and stakeholders • Award reporting – Document data progress and final data publication • Things to consider after award completion – delayed data release, future proposals

  17. A. Initiating Data management for your project • Coordinate with your project team – Make sure that project roles for data management are clearly allocated to fulfill the objectives of your DMP • Connect to target data facilities – Early on, discuss steps for submitting datasets, including fulfillment of data standards. Sharing your DMP may help. • Get help from your institution – Your library likely offers dedicated research data services to support you on your project. https://arcticdata.io/submit/

  18. B. Award Reporting for your DMP • Where to report – Report on data activities in the “Products-Websites” section of your annual and final project reports • Annual Project Report – Provide a status update on progress toward goals of your DMP, including datasets that have been made publicly available within the last year. If your plans have changed, please explain. • Final Project Report – List datasets that have been made publicly available. When listing data, provide a citation with a links back to the repository (e.g., via DOI); this will streamline compliance checking by Program Director

  19. C. Things to consider after award completion Certain Division data policies allow a finite period of time between final data collection and subsequent data publication, which may extend beyond the award completion date. In such cases: • Final Project Report – State plans for future data sharing. It may be possible to pre-populate repository with metadata entry before full data release. • When dataset is published – Inform your managing Program Director • In subsequent NSF proposals – Refer to published data products in “Results of Prior NSF Support” and in your Biosketch

  20. Presentation Overview • Preparing the DMP for your proposal • Executing the DMP for your awarded project • Additional considerations for DMPs

  21. III. Additional considerations for DMPs • Think of the audience – Be explicit and keep it succinct • Keep it organized – I suggest describing each dataset to be generated. DMP template tools can help (e.g., DMPtool, ezDMP) • Get help – Data management experts at your institution’s library or at relevant data facilities can be great resources https://dmptool.org/

  22. Questions? NOTE:All statements here are my own and do not necessarily reflect official NSF policy. Talk to your Program Director and examine the PAPPG (Proposal & Awards Policies & Procedures Guide), Division data policies, and program solicitations for definitive statements on NSF policy.

More Related