1 / 36

Grid Enabling Open Science

Grid Enabling Open Science. Ian Foster Computation Institute Argonne National Lab & University of Chicago. Thanks!. DOE Office of Science NSF Office of Cyberinfrastructure Colleagues at Argonne, U.Chicago, USC/ISI, and elsewhere Mary Kratz and my other colleagues involved in US HealthGrid.

zofiah
Download Presentation

Grid Enabling Open Science

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. GridEnabling Open Science Ian Foster Computation Institute Argonne National Lab & University of Chicago

  2. Thanks! • DOE Office of Science • NSF Office of Cyberinfrastructure • Colleagues at Argonne, U.Chicago, USC/ISI, and elsewhere • Mary Kratz and my other colleagues involved in US HealthGrid

  3. What is the Grid? “The Grid is an international project that looks in detail at a terrorist cell operating on a global level and a team of American and British counter-terrorists who are tasked to stop it” Gareth Neame, BBC's head of drama

  4. Well, Not Exactly! “The Grid is an international project that looks in detail at scientific collaborations operating on a global level and a team of computer scientists who are tasked to enable it” At least, that’s where it started …

  5. When power transmission became (quasi-)ubiquitous …

  6. We No Longer Had to Travel to Power Plants

  7. We Invented New Tools

  8. We Worked in New Ways

  9. As telecommunications … … become (quasi-)ubiquitous

  10. We Can Access Computing on Demand Public PUMA Knowledge Base Information about proteins analyzed against ~2 million gene sequences Back OfficeAnalysis on Grid Millions of BLAST, BLOCKS, etc., onOSG and TeraGrid Natalia Maltsev et al.,http://compbio.mcs.anl.gov/puma2

  11. We Can Invent New Tools Bennett Berthenthal et al., www.sidgrid.org

  12. We Can Workin New Ways Credit: Jonathan Silverstein, U.Chicago

  13. DATA ACQUISITION ADVANCEDVISUALIZATION ,ANALYSIS COMPUTATIONALRESOURCES IMAGING INSTRUMENTS LARGE-SCALE DATABASES Grid:Unifying Concept & Technology • Enable on-demand access to, and federation of, diverse resources • Computers, storage, data, people, … • Resources can be distributed, heterogeneous • Networks & protocols provide the connectivity • Software provides the “glue”

  14. Technology, Infrastructure, & Standards interoperability Applications User-level Middleware and Tools integration System-level Common Infrastructure Resources For example

  15. Towards Open eScience Disease Climate change Sustainable energy Natural disasters

  16. Empirical Theory Simulation Data Emergence of New Problem Solving Methodologies <0 1700 1950 1990 eScience: When brute force doesn’t work anymore (Szalay)

  17. First Generation Grids:Batch Computing Focus on aggregation of many resources for massively (data-)parallel applications EGEE Globus

  18. fMRI: A Broad Picture Globus

  19. fMRI Performance Study Yong Zhao and Ioan Raicu, U.Chicago

  20. Second Generation Grids:Service-Oriented Science • Empower many more users by enabling on-demand access to services • Grids become an enabling technology for service oriented science (and business) • Grid infrastructures host services • Grid technologies used to build services Science Gateways “Service-Oriented Science”, Science, 2005

  21. “Web 2.0” • Software as services • Data- & computation-richnetwork services • Services as platforms • Easy composition of services to create new capabilities (“mashups”)—that themselves may be made accessible as new services • Enabled by massive infrastructure buildout • Google projected to spend $1.5B on computers, networks, and real estate in 2006 • Many others are spending substantially • Paid for by advertising Declan Butler, Nature

  22. User Discovery tools Analysis tools Service-Oriented Science:E.g., Virtual Observatories Gateway Data Archives Figure: S. G. Djorgovski

  23. Service-Oriented Science People create services (data or functions) … which I discover (& decide whether to use) … & compose to create a new function ... & then publish as a new service.  I find “someone else” to host services, so I don’t have to become an expert in operatingservices & computers!  I hope that this “someone else” can manage security, reliability, scalability, … ! ! “Service-Oriented Science”, Science, 2005

  24. Skyserver Sessions Source: Alex Szalay

  25. Service-Oriented Science& Cancer Biology caBIG: sharing of infrastructure, applications, and data. Data Integration!

  26. Gene Database caArray Protein Database Grid Services Infrastructure (Metadata, Registry, Query, Invocation, Security, etc.) Image Microarray Tool 2 Tool 3 Cancer Bioinformatics Grid Grid-Enabled Client Analytical Service Tool 1 Tool 2 Research Center NCICB Grid Data Service Tool 3 Tool 4 Grid Portal Research Center Globus

  27. Grid PACS ( Image Storage ) GridFTP Server MEDICUS: Management of DICOM Images Grid Node Hospital Domain Imaging Technologist Imaging Modality . 1 Cache Image Storage Grid Node . Dicom Grid Interface Service ( DGIS ) Grid PACS Meta Catalog ( DICOM Image Attributes ) OGCE - DAI Globus Stephan Erberich, Manasee Bhandekar, Ann Chervenak, et al.

  28. Children’s Oncology Grid:A MEDICUS Deployment Globus

  29. DICOM images Send (publish) Query/Retrieve (discover) Grid Archive Fault tolerant Bandwidth Security Authentication Authorization Cryptography Access Web portal Applications Computing Data Mining DICOM Grid Interface Service (DGIS)+Meta Catalog Service (OGSA-DAI) Data Replication Service (DRS) X.509 Certificates + MyProxy Delegation Grid Web Portal, OGCE / GridSphere GRAM, OGSA-DAI MEDICUS Under the Covers Globus Globus Toolkit Release 4

  30. Grids are Communities of People as well as Computers • Based on (technology-mediated) trust • Common goals • Processes and policies • Reward systems • That share resources • Computers • Data • Sensor networks • Services • Supported by software, standards, infrastructure

  31. Grid Infrastructure Supports Communities interoperability Applications User-level Middleware and Tools integration System-level Common Infrastructure Resources For example

  32. GlobalCommunity

  33. Globus Downloads Last 24 Hours Globus

  34. Provider of commercial support, services, & products around open source Globus • Commercial distribution of GT4 & beyond • Univa Globus Cluster Edition • Univa Globus Enterprise Edition • Committed to open source & open standards • Univa is contributing to Globus open source • Big contributions to GT4 development, testing • New functionality: install shields, security configurator, GridFTP extensions • Additional contributions expected

  35. Megabytes & gigabytes Tarballs Journals Individuals Community codes Supercomputer centers Makefile Computational science Mostly physical sciences 1000s of computationalists Government funded  Terabytes & petabytes  Services  Wikis  Communities  Science gateways  Campus & national grids …  Workflow  Science as computation  All sciences (& humanities)  Millions of researchers  Gov & industry funded Science 1.0 Science 2.0

  36. Summary • Technology exponentials are transforming the nature of research • Data-, compute-, & communication-intensive approaches are increasingly influential • Grid is a unifying concept & technology • Federation & on-demand access to resources • An enabler of “service-oriented science” • Transforms how we conduct research & communicate results • Demands new reward structures, training, & infrastructure For more information: http://ianfoster.typepad.com

More Related