140 likes | 272 Views
Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure CyberBridges Workshop June 25, 2012. 1. Framing the Challenge: Science and Society Transformed by Data. Modern science Data- and compute-intensive Integrative, multiscale
E N D
Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure CyberBridges Workshop June 25, 2012 1
Framing the Challenge:Science and Society Transformed by Data • Modern science • Data- and compute-intensive • Integrative, multiscale • Multi-disciplinary Collaborations for Complexity • Individuals, groups, teams, communities • Sea of Data • Age of Observation • Distributed, central repositories, sensor- driven, diverse, etc
Explosive Growth in Size, Complexity, and Data Rates • Enormous data sets are being generated by modern experiments and observations • Automatic extraction of new knowledge about the physical, biological and cyber world continues to accelerate • Infusion of data-intensive computation into science, engineering and education is revolutionizing research • Multi-cores, concurrent and parallel algorithms, virtualization and advanced server architectures will enable data mining and machine learning, and new approaches for innovation and discovery
Computer Architecture Trends • Continuing growth in number of cores • Increased use of hybrid accelerators • Advances in interconnect technologies will slow; more complex memory subsystems will be deployed • Power consumption becoming ever more important because of cost and performance • Application performance will be dominated by data movement • Clouds and data centers will play an increasingly larger role in data and compute infrastructure
Software Challenges • Simulation and model scalability is a major requirement for algorithm research and development • Parallel programming research is required to address order of magnitude changes in compute resources • New operating systems, architectures, file systems research, fault tolerance, verification and validation, complex simulation, and cybersecurity • Inadequate numbers of software workforce and expertise being produced • Focus on sustainability and usability is essential
Cyberinfrastructure Framework for 21st Century Science and Engineering CIF21: Grand Challenge Communities Learning & Workforce Development Scientific Instruments Innovation, Discovery Data Advanced Computational Infrastructure Campus Bridging, Cybersecurity Software
Scientific Data Challenges Square Kilometer Array Climate, Environment Exa Bytes Peta Bytes Tera Bytes Giga Bytes Volume/Growth Genomics Bytes per day Useful Lifetime Climate, Environment TeraGrid, Blue Waters LHC LHC LSST Distribution Genomics Many smaller datasets… 2012 2020 Data Access
NSF Data strategy • Establish a national data infrastructure to support science, engineering and education • Ensure that this infrastructure stays at the most advanced state of sophistication and is sustainable • Support transformative interdisciplinary and collaborative research stimulated by data • Development of the next generation of compute and data intensive workforce • Development of a suite of policies for data, software, publications and other digital outputs
Advanced Computing Infrastructure Strategy • Foundational research to fully exploit parallelism and concurrency through innovations • Applications research and development in high end computing resources • Building, testing and deploying innovative resources in a collaborative environment • Development of comprehensive education and workforce programs • Development of grand challenge community programs
Creating Scalable SoftwareDevelopment Environments • Create a software ecosystem that scales from individual or small groups of software innovators to large hubs of software excellence Focus on innovation Focus on sustainability
Cyber-infrastructure: EarthCube Goal: to transform the conduct of research in geosciences by supporting community-based cyberinfrastructure to integrate data and information for knowledge management across the Geosciences. Community: More than 900 members subscribed to EarthCube web site. Second Charette: June 12-14 GEO-OCI Partnership
Some observations • Science and Scholarship are team sports • Collaboration/partnerships will change significantly • Growth of dynamic coalitions and virtual organizations • International collaboration becomes ever more important • Innovation and discovery will be driven by analysis • Mining vast amounts of new and disparate data • Collaboration and sharing of information • Mobility and personal control will continue to drive innovation and research communities • Gaming, virtualization and social networking will transform the way we do science, research and education
Solicitations that build Cyberinfrastructure • Data Infrastructure Building Blocks (DIBBs) • Software Infrastructure for Sustained Innovation (SI2) • EarthCube • Core Techniques and Technologies for Advancing Big Data Science & Engineering (BIG DATA) • Computational and Data Intensive Science and Engineering in the Mathematical and Physical Sciences (CDS&E) • Campus Cyberinfrastructure - Network Infrastructure and Engineering Program (CC-NIE) • Science, Engineering and Education for Sustainability NSF-Wide Investment (SEES) • Integrative Graduate Education and Research Traineeship, or IGERT (CIF21 Track)