130 likes | 342 Views
ORNL DAAC: Data and Services. Robert Cook and Suresh SanthanaVannan Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN Presentation at DataONE EVA Working Group Meeting Albuquerque, NM November 17-19, 2009. ORNL DAAC: What’s that?.
E N D
ORNL DAAC:Data and Services Robert Cook and Suresh SanthanaVannan Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN Presentation at DataONE EVA Working Group Meeting Albuquerque, NM November 17-19, 2009
ORNL DAAC: What’s that? • Oak Ridge National Laboratory Distributed Active Archive Center • Archive data products produced by projects within NASA’s Terrestrial Ecology Program • Mission: • assemble, distribute, and provide data services for a comprehensive archive of terrestrial biogeochemistry and ecological dynamics observations and models to facilitate research, education, and decision-making in support of NASA’s Earth science. ORNL DAAC’s Web Site www.daac.ornl.gov
LBA LBA BOREAS S2K S2K LAI/fPAR NPP LAI/fPAR NPP ORNL DAAC: Data Collections (Number of Data Sets = 826) 1. Field Campaigns (647) 2. Validation of Remote Sensing Products (21) • 6-9 year intensive study of a region: • Amazon (LBA) • Northern Canada (BOREAS) • Southern Africa • (SAFARI 2000) In-situ Observations Remote Sensing ? 4. Model Code (9) 3. Regional and Global Studies (147) • Benchmark Models • IBIS, BIOME-BGC, LSM • Manuscript Models • PNeT, Century, Biome-BGC • Climate • Soils • Vegetation • Hydroclimatology
Total = ~500 GB Number of Data Sets Median = 488 KB Mean = 512 MB ORNL DAAC Data Holdings (2009)
Data CharacteristicsNumber of files (granules) per data set Median = 2 granules per data set Mean = 263 granules per data set
Tools and Services: For the highly diverse ORNL DAAC community Global land surface modelers • Discovery: FTP browse, OPeNDAP catalog • Formats: ASCII Grid, netCDF, HDF, binary • Tools: FTP Download, OPeNDAP servers(THREDDS (Thematic Real-time Environmental Distributed Data Services) Data Server catalog) Spatial data / Remote Sensing users • Discovery: FTP, Metadata catalog, Catalog of Spatial Data • Formats: ASCII Grid, GeoTIFF, shape files • Tools: MODIS Subsetting Tools, Spatial data download, WebGIS Field investigators • Discovery: Metadata catalog, Google Search • Formats: ASCII tables • Tools: Databases, spreadsheets, visualization (WebGIS, MODIS Time Series, SPEC/ ISIS), on-line services (MODIS Subsets) Spectrum of users is wide
User Working Group • Board of Directors function • not FACA (Federal Advisory Committee Act) • Serves a peer review function for an on-going project (began in 1992) • Represent the scientific interests of the research community – members are data providers and data users • Scientists funded by NASA and other agencies (NSF LTER) • ORNL DAAC’s NASA Program Scientist (Diane Wickland) • Assist in defining the DAAC's science goals, setting priorities • Driving force for DAAC evolution over the past decade • Provide guidance on DAAC activities, including data set acquisition, development of tools and services, incorporation of new technologies
Observations • Data centers should be a partnership of data managers, data providers, and data users • Seek formal guidance from “User Working Group” • Data centers should facilitate new science • Metric is citations to data sets • Changing research demands and advances in information technologies are creating new roles for data centers
Close coordination / communication among data managers, those making the measurements, modelers, and other data users is critical Data management coordination through User Working Group Users Collectors Data Management and Analysis System
Metadata needed to Understand Data The details of the data …. parameter name Measurement sample ID location date For those on Investigator’s team, amount of metadata required to understand the data is small
Metadata Needed to Understand Data: 20-year perspective words, words units method Parameter def. lab field Method def. method Units def. parameter name Units media date words, words. QA def. QA flag Measurement Record system records generator sample ID location date GIS org.type name custodian address, etc. coord. elev. type depth Sample def. type date location generator From Raymond McCord, ORNL
Provide tutorial on “Best Practices for Preparing Ecological Data to Share” • Cook et al. 2001 Bull. ESA. 82: 138 – 141 • Best Practices include: 1. Assign Descriptive File Names 2. Use Consistent and Stable File Formats 3. Define the Parameters 4. Use Consistent Data Organization 5. Perform Basic Quality Assurance 6. Assign Descriptive Data Set Titles 7. Provide Documentation • Update on-line: http://daac.ornl.gov/PI/pi_info.html Best Practices for Preparing Ecological and Ground-Based Data Sets to Share and Archive Robert B. Cook, Richard J. Olson, Paul Kanciruk, and Leslie A. Hook Environmental Sciences Division Oak Ridge National Laboratory