1 / 31

Unidata 2008: Shaping the Future of Data Use in the Geosciences

Unidata 2008: Shaping the Future of Data Use in the Geosciences. Expanding Horizons: Using Environmental Data for Education, Research, and Decision Making 23 June 2003 Boulder, CO Mohan Ramamurthy Unidata Program Center UCAR Office of Programs Boulder, CO. Thank you, Dave.

gaston
Download Presentation

Unidata 2008: Shaping the Future of Data Use in the Geosciences

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Unidata 2008:Shaping the Future of Data Use in the Geosciences Expanding Horizons: Using Environmental Data for Education, Research, and Decision Making 23 June 2003 Boulder, CO Mohan Ramamurthy Unidata Program Center UCAR Office of Programs Boulder, CO

  2. Thank you, Dave • I wish to take this opportunity to extend my sincerest gratitude to Dave Fulker, the founding director of Unidata, for his distinguished service to the Unidata Community for nearly 20 years. • Unidata would not be what it is today without his vision, leadership, energy and his many extraordinary qualities. • And thank you to Ben Domenico for his excellent stewardship during the transition.

  3. The Word of the Day for Jun 23rd is • The Word of the Day for Jun 23 is: • bloviate \BLOH-vee-ayt\ verb : to speak or write verbosely and windily [Courtesy: Jo Hansen, Unidata Program Center] Example sentence: Mohan can bloviate on a par with the windiest of professors, but he's also capable of being concise and getting right to the point. (yeah, right)

  4. Expanding Horizons • New Strategic Plan • New Director • New 5-year proposal • Many new and exciting initiatives • New logo!

  5. Unidata • Mission Statement: Provide data, tools, and community leadership for enhanced Earth-system education and research. At the Unidata Program Center, we • Facilitate Data Access • Provide Tools • Support Faculty and Staff • Build and Advocate for a Community user workshops come under this activity

  6. Technology Portfolio 1) McIDAS: A client/server analysis and display package, originally developed by U. Wisconsin/SSEC, that emphasizes image processing of data from satellite-borne sensors; 2) GEMPAK: An analysis, display, and product generation package for meteorological data; 3) Local Data Manager: Software for capturing, disseminating, and organizing data in near-real time; It is the heart of the Internet Data Distribution (IDD) system; 4) NetCDF: Asoftware interface for platform-independent access to self describing datasets; 5) Integrated Data Viewer: Java-based, platform-independent data analysis and 3D visualization tools; 6) THREDDS: A project to facilitate remote access to thematic, distributed, interdisciplinary data servers;

  7. Unidata as a Diverse Community • About 150+ sites are participating in Unidata Internet Data Distribution (IDD) system • 120 or so of those sites are in academia and the rest in government and research labs • User community is interdisciplinary - 2/3rd of sites have users outside atmospheric sciences

  8. Internet Data Distribution • Approximately 2 GB of data injected/hour from distributed sources; • Unidata IDD/LDM uses more of the Internet2 than any other advanced application; • Approx. 5 Terabytes of data transmitted each week. (Amount varies with weather) Model Satellite Radar By design, the system has no data center.

  9. Proposed WSR-88D Data Flow(NWS Plans)

  10. Active, student-centered learning Earth-system science or “holistic” approach to education Learning science by doing science Observations (data) Tools (models, visualization) Discovery Education Drivers (a.k.a. A Community-Articulated Need)

  11. Science Drivers Grand Challenges in Environmental Sciences National Research Council NSF Director Rita Colwell, 1998: "Interdisciplinary connections are absolutely fundamental. They are synapses in this new capability to look over and beyond the horizon. Interfaces of the sciences are where the excitement will be the most intense... ."

  12. Multidisciplinary Problems • Fire Danger determination requires taking into account past, present and future weather, fuel types, and the state of both live and dead fuel moisture. • Dead Fuel Moisture • Live fuel moisture (NDVI) • Drought conditions • Atmospheric stability • Lightning maps • Lightning ignition efficiency • Airflow • Recent rainfall • Rainfall forecast

  13. Dual-Polarization Radar use in Fire Weather Management • The differential reflectivity (ZDR) values are noteworthy in the smoke signal • Many regions show ZDR >+6 dB. • Suggests flattened ash particles (like corn flakes) Source: CHILL Radar Group, CSU

  14. Flooding due to Tropical Storms Tropical Storm Allison Research studies and emergency management of hurricane-induced flooding involve integrating data from atmospheric sciences, oceanography, hydrology, geology, geography, and social sciences.

  15. Multidisciplinary Synthesis • Requires integration of disparate datasets and databases from diverse sources that are distributed geographically and disciplinarily; • Needs integration of Scientific Information Systems with Geographic Information Systems • The integration poses numerous challenges; • However, such integration is critical to solving societal problems and advancing science. • Metadata is crucial to achieving integration

  16. Remote Sensing & Data Explosion • In the next 10 years, about 100 new satellite instruments will be launched to monitor the environment • Five-order magnitude increase in satellite data is expected during that period • GIFTS (Geostationary Imaging Fourier Transform Spectrometer) will have about 1700 channels and a resolution of 4 km • Each NPOESS satellite will generate one terabyte of data each day • Advances in Radar technology • 28 fold increase in WSR-88D data volume in 5 years • Phased-array radars will generate 100 fold increase data By 2004, NOAA will ingest more data in one year than was contained in the total archive in 1998.

  17. Advances in Modeling • Shift from a purely deterministic to a more probabilistic approach, requiring the use of ensemble modeling techniques. • Growing emphasis on multidisciplinary studies, requiring coupled models: • e.g., Hurricane landfall flooding problem: Atmospheric model (WRF/MM5), Ocean model (ROMS), Hydrologic model (HMS)

  18. Local Modeling: A Notable Trend • Over 30 universities are now running mesoscale models locally. • One can think of this aggregation as a national forecasting instrument • However, only one or two groups initializing their model runs with local observations • As the scale of these local model runs becomes finer, there is a natural desire to integrate their output with information from other sources (e.g., hydrology, infrastructure, societal datasets in GIS form) Iowa St. Linux Cluster

  19. Technology Drivers • Object-oriented programming • Open Standards, Interoperability and Open Source Movement • Metcalfe's Law: the usefulness, or utility, of a network increases as the square of the number of users. • Web services (HTTP, Java, XML, SOAP, UDDI, …) • Digital libraries (Metadata, discovery, information services…) • Grid environments and distributed computing • Commodity microprocessors • Cluster computing • High bandwidth networks: 10GigE, Fast IP, … • Broadband access • Wireless networks: 802.11 networks, GPRS, 3G • IPv6: Next-generation internet protocol • Collaborative computing • Scientific data mining and knowledge discovery

  20. Services Services Users Users Metadata repository Metadata repository Collections (Data, tools, educational materials) Collections (Data, tools, educational materials) Web Services and the Wild and Wooly World of Markup Languages • Web services is a technology and process for discovery and connection. • The eXtended Markup Language, XML, is accepted as THE emerging standard for data interchange on the Web. • XML allows authors to create their own markup, which has led to the proliferation of “MyOwn Markup Language”

  21. Five-year Core Funding NSF Proposal • Title: Unidata 2008: Shaping the Future of Data Use in the Geosciences • Six endeavors are proposed, focusing on Community and Support Services and Data Services, Systems, and Tools • The proposed endeavors will enable the community to advance scientific exploration, education, and decision-making. We are moving from an era of data provision to one in which data- and related web-services are emphasized “The unanimous finding of the panel is that the Unidata Program Center program be supported as fully as possible by NSF for the years 2003-2008.”

  22. Proposed Endeavors • Endeavor 1.Responding to a broader and more diverse community. • Respond to increased emphasis on Earth-system science (e.g., bring new data sets to the community) • Establish new partnerships with related communities (e.g. with Hydrology via CUAHSI) • Support new tools in technically less-sophisticated institutions (e.g., community colleges) • Endeavor 2.Comprehensive support services • Deploy web-based training modules • Simplify installation and maintenance for all supported packages • Explore new technologies (e.g., Access Grid) to facilitate remote collaboration

  23. Endeavor 3: Real-time, self-managing data flows • More flexibility and control • Many more feed types for finer control over routing and subsetting • Configurable product priorities • Self-managing data flows (automatic dynamic routing) • Application-level multicast looks promising for hundreds of sites (IP multicast not suitable due to limitations) • NLDM: data flooding via Usenet protocols may provide practical routing solution (needs more testing) • Support for new standards • Use of IP version 6 protocols  • Internet2, Grid and e-services standards (authentication, resource use, ...) • Location-transparency for data

  24. LDM-5 Vs. LDM-6 Latencies CONDUIT Experience Average delivery time : ~20 seconds to top-tier sites

  25. Endeavor 4.Software to analyze and visualize geoscience data • Integrate diverse datasets • Support analysis and visualization of local and climate modeling efforts • Develop collaborative tools to make effective use of shared visualizations • Allow customized user experiences • Adapt to GIS frameworks • Cloud water isosurface from COMMAS storm model data (courtesy Adam Houston and Dan Bramer, NCSA/UIUC)

  26. People Discovery andPublication Tools Analysis and Visualization Tools THREDDS Middleware Discovery and Publication Services Data Services Data CatalogServices CatalogGeneration Tools Documents Data

  27. THREDDS Servers GIS Servers THREDDS Server GIS Server THREDDS Server Satellite, radar, forecast model output, … datasets GIS Server Demographic, infrastructure, societal impacts, … datasets THREDDS, GIS, DL Interoperability THREDDS Client Applications GIS Client Applications OGC or OPeNDAPADDE. FTP… protocols OpenGIS Protocols: WMS, WFS, WCS OGC or proprietary GISprotocols Metadatacrosswalk Metadatacrosswalk Open Archives Initiative (OAI) Metadata Harvesting Digital Library Discovery Systems

  28. NetCDF-HDF Integration File File Endeavor 6: Improved data access infrastructure Proposed Implementation Current Implementation • Extend netCDF to high-performance computing environment • Implement parallel I/O, large grids, etc. • Work will directly benefit WRF and CCSM communities Application Application netCDF netCDF HDF5(serial and/or parallel) POSIX I/O Split files Custom POSIX I/O Stream MPI-I/O User-defined device Network or to/from another application Parallel file system Meta-data Rawdata

  29. The Visual Geophysical Exploration Environment (VGEE) • The VGEE is an integrated framework in which students use visualization tools, data, and curricular materials to learn basic physical principles of atmospheric science • It includes: • A learner interface to the IDV • Java-based concept models to support physical insight • A curriculum to guide inquiry • A catalog of data (THREDDS)

  30. Identify RelateExplainIntegrate VGEE: An Integrated Framework Concept Models, which are used to explore relations in an idealized context. Students notice that the Western Pacific is considerably warmer than the East.

  31. Concluding Remarks • We live in an exciting moment in the history of the Earth sciences. • Workshops like this and the diversity of representation from academia are testimony to the vibrancy of the community and the program. • The portfolio of tools and technologies within Unidata, coupled with the energies of a creative and collaborative community, puts us in an ideal position to meet the important challenges facing the education and research communities in the atmospheric and related sciences.

More Related