220 likes | 303 Views
Water. EarthScope. Oceans Observations. Cyberinfrastructure in an Era of Observation and Simulation. Eva Zanzerkia, NSF GEO/EAR. Environmental Obs. Earth System Modeling. Satellites. This Talk. Context at the National and Agency scale CIF21 Data Management What this means for CUAHSI.
E N D
Water • EarthScope • Oceans Observations • Cyberinfrastructure in an Era of Observation and Simulation Eva Zanzerkia, NSF GEO/EAR • Environmental Obs • Earth System Modeling • Satellites
This Talk • Context at the National and Agency scale • CIF21 • Data Management • What this means for CUAHSI
Framing the Challenge:Science and Society Transformed by Data • Modern science • Data- and compute-intensive • Integrative, multiscale • Multi-disciplinary Collaborations for Complexity • Individuals, groups, teams, communities • Sea of Data • Age of Observation • Distributed, central repositories, sensor- driven, diverse, etc
NSF ACCI: Innovation and New Paradigms • Multiple communities must engage rapidly, integrate observations, expertise • New multidisciplinary research communities and teams • Teams notified of events by social networks, mobile • Simulations by global teams • New types of compute, data, collaboration and software infrastructure • Comprehensive Computational and Data-Enabled Science & Engineering (CDS&E) • Fundamental to support 21st century science and engineering
New Budget Thrust for 2012: CIF 21 • Broad Principles of CIF21 • Comprehensiveand integratedcyberinfrastructure to transform research, innovation and education • Focus on computational and data-intensive science to address complex problems • Increase of $117 million over FY 2010 enacted level • Four major components • Caveat: This is only the president’s budget request to congress
CIF21: Four Thrust Areas Organizations Universities, schools Government labs, agencies Research and Medical Centers Libraries, Museums Virtual Organizations Communities Community Research Networks Scientific Instruments Large Facilities, MREFCs,telescopes, Colliders, shake Tables, laboratory, field deployment, Sensor Arrays Expertise Research and Scholarship Education Learning and Workforce Development Interoperability and operations Cyberscience Data-Enabled Science Data Databases, Data repositories Collections and Libraries Data Access; storage, navigation management, mining tools, curation, privacy Computational Resources Supercomputers Clouds, Grids, Clusters Visualization Compute services Data Centers Discovery Collaboration Education New Computational Resources Networking Campus, national, international networks Research and experimental networks End-to-end throughput Cybersecurity Software Applications, middleware Software development and support Cybersecurity: access, authorization, authentication Access and Connections to Resources
Data-Enabled Science • Data Services Program (data) • Provide reliable digital preservation, access, integration, and analysis capabilities for science and/or engineering data over a decades-long timeline • Data Analysis and Tools Program (information) • Data mining, manipulation, modeling, visualization, decision-making systems • Data-intensive Science Program (knowledge) • Intensive disciplinary efforts, multi-disciplinary discovery and innovation Observations GIS Models Remote Sensing Climate
New Computational Infrastructure • Computational and data-enabled resources • HPC, Clouds, Clusters, Data Centers • Long-term software for science and engineering • Sustained software development and support • Discipline-specific activities • Services, tools, compute environments that serve specific research efforts and communities Wyoming SCS
Access and Connectivity • Network connections and engineering program • Real-time access to facilities and instruments; Begins to tie in MREFC activities • Integration and end-to-end performance to provide seamless access from researcher to resource • Cybersecurity – from innovation to practice • Deployment of identity management systems • Development of cybersecurity prototypes
Community Research Networks • New multi-disciplinary research communities • Address challenges beyond individuals and disciplinary research communities • Support and optimize collaboration across small, mid-level and large community networks • Advanced research on community and social networks • Structures, leadership, fostering and sustainability • “virtuous cycle” providing feedback through formal evaluation and program iteration
CIF21 – Geosciences Foci • GEO plans to leverage substantial CI investments • Connect Well-established facilities: ES, OOI, IRIS, UNAVCO, IEDA, NCAR, CIG, CUAHSI,etc.... • EAR Geoinformatics program • Participation in NSF-wide activities: PetaApps, SI2 • Partnerships with other agencies and industry • Long-term plan: connect resources, science, users through National cyberinfrastructure • New and enhanced computational platforms, tools, data centers to analyze, manipulate, visualize and share large and complex data sets • a framework for open and easy access of all geoscience data and integrationg with other domains • Infrastructure and technology for retrieving and sharing observational data • Sustained training programs to create a workforce capable of multi-disciplinary science
EarthCube • GEO and OCI are working in partnership to support the development of a geosciences wide cyberinfrastructure • DCL: nsf11065 http://www.nsf.gov/pubs/2011/nsf11065/nsf11065.jsp?WT.mc_id=USNSF_179 • Webinar July 11, 2011. • Early Fall: charrette meeting to bring the geosciences community together and focus a design • Prototypes (up to 3) will be supported in early Spring
Multiple Modes of Support Are Necessary for EarthCube and CIF21 • “Modes of support” that are essential to build CIF21 infrastructure and to engage in CIF21 activities. • Focused grants to individual PIs or small groups • Focused programs that are community driven • Small centers • Large national centers • Cyber-enhanced field programs • Cyber-enhanced observing facilities and MREFC projects • NSF-wide initiatives • Education, outreach, and training activities (EOT)
EarthCube Modes of Support Well-Connected through CIF21 Infrastructure Loosely or Not Connected Access and Connection New Computational Infrastructure Networks Data-Enabled Science 10-15 Year Timeline
CIF21 and CUAHSI • CIF21 and EarthCube is still being developed. There is the opportunity to impact implementation through participation. • CIF21 dovetail with CUAHSI’s Strategic Plan: • Observation and Synthesis • improving accessibility to observational networks • data discovery and community modeling • Data Access • maintain services for diverse data • develop citation and tracking mechanisms • emerging data types and needs • outreach for standards • What are the capabilities that water sciences researchers will need in the future?
Where can CI Enhance Partnerships? • Connect observing systems • Connect to other domains and resources: • DataNet; computational facilities; synthesis centers
Data Management Plans Long-standing policy states NSF’s expectations with respect to sharing of data and other research products: • Grant Conditions:“NSF expects investigators to share with other researchers, at no more than incremental cost and within a reasonable time, the data, samples, physical collections and other supporting materials created or gathered in the course of the work. It also encourages grantees to share software and inventions or otherwise act to make the innovations they embody widely useful and usable.”
Data Management Plan • All proposals must include, as a supplementary doc up to 2 pages, a Data Management Plan. • Plan should describe how the proposal will conform to NSF policy on dissemination and sharing of research results. • Plan will be reviewed as part of the intellectual merit and/or broader impacts of the proposal depending on the proposal intent. • FastLane will not permit submission of a proposal that is missing a data management plan.
What’s Included? • Anticipated data, samples, physical collections, software, curriculum materials, anything produced during the reseach. • Information on analytical standards, metadata, solutions or remedies where these do not exist. • Plans for data/sample access and sharing, including provision for protection of privacy/confidentiality/ security, intellectual property, or other rights, if appropriate. • Plans for re-use, re-distribution, and production of derived data sets. • Plans for archiving/preserving data, samples, and other research products and providing access to others.
Data Management Plan Resources • http://www.nsf.gov/geo/geo-data-policies/index.jsp • Data Policies and Community Standards • Templates • Review of DMP is an evolving process. More guidance on review will be forthcoming.
Opportunities for CUAHSI • CIF21: Integration and Partnerships • Take advantage of resources • Wyoming Supercomputing Center • Powell Center • TeraGrid (XD) • DataNets: DataOne, Data Conservancy • Upcoming Competitions • SI2: NSF 11-539 July 18, 2011 • EAR Geoinformatics (?): Will be tied to CIF21 and reports from recent Geoinformatics WS • Data management plans are an opportunity • Set community standards for peer review • Provide resources for a broad community • Articulate your science needs and tools • Be flexible: technology advance outpaces integration into science; • Unanticipated Scientific Discovery is still allowed!