160 likes | 265 Views
Application Hosting Services — Enabling Science 2.0 —. Ian Foster Computation Institute Argonne National Lab & University of Chicago. “Web 2.0”. Software as services Data- & computation-rich network services Services as platforms
E N D
Application Hosting Services— Enabling Science 2.0 — Ian Foster Computation Institute Argonne National Lab & University of Chicago
“Web 2.0” • Software as services • Data- & computation-richnetwork services • Services as platforms • Easy composition of services to create new capabilities (“mashups”)—that themselves may be made accessible as new services • Enabled by massive infrastructure buildout • Google projected to spend $1.5B on computers, networks, and real estate in 2006 • Many others are spending substantially • Paid for by advertising Declan Butler, Nature
User Discovery tools Analysis tools Science 2.0:E.g., Virtual Observatories Gateway Data Archives Figure: S. G. Djorgovski
Science 2.0 People create services (data or functions) … which I discover … & compose to create a new function ... and then publish as a new service. I find “someone else” to host services, so I don’t have to become an expert in operatingservices & computers! I hope that this “someone else” can manage security, reliability, scalability, … ! ! “Service-Oriented Science”, Science, 2005
The Importance of “Hosting”and “Management” Tell me about this star Tell me about these 20K stars Support 1000sof users E.g., Sloan DigitalSky Survey, ~10 TB; others much bigger
Creating Web Services • Take an “application” • An arbitrary executables • A procedure in Java or another language • A Web service • And provide a “Web Service” interface • Address authentication & authorization • Pass input data (XML, files, …?) • Invoke the application • Permit monitoring & control • Return output data (XML, files, …?)
Creating Services:E.g.,Introduce Authoring Tool • Define service • Create skeleton • Discover types • Add operations • Configure security • Modify service See also: SOAPLab, OPAL, pyGlobus, Gannon, etc. Introduce: Hastings, Saltz, et al., Ohio State University
Data Service @ uchicago.edu <BPEL Workflow Doc> <Workflow Inputs> BPEL Engine link <Workflow Results> Composing Web Services:E.g., BPEL Workflow System link Analytic service @ duke.edu link link Analytic service @ osu.edu caBiG: https://cabig.nci.nih.gov/; BPEL work: Ravi Madduri et al.
Hosting & Management:Application Hosting Services Application providers Appln Code Application deployment Application Prep Tool(s) Provisioning Application client Resource Provider Appln Code Users Resource Provider Appln Code AHSmanagement Hosting Service Author ization Admins Persistence Policymanagement PDP
Apache Tomcat Service Container Service Service Service GetRP GetRP GetRP GetMultRPs GetMultRPs GetMultRPs EPR EPR EPR EPR EPR EPR EPR EPR EPR SetRP SetRP SetRP Resource Resource Resource QueryRPs QueryRPs QueryRPs RPs RPs RPs Subscribe Subscribe Subscribe SetTermTime SetTermTime SetTermTime ResourceHome ResourceHome ResourceHome Destroy Destroy Destroy Realization:We Have Much of This in GT4!! Security Authorization PIP PDP State Management Persistence WorkManager DB Conn Pool JNDI Directory GT4 Web Services Container Globus Toolkit Version 4: Software for Service-Oriented Systems, LNCS 3779, 2-13, 2005
Realization:We Have Much of This in GT4!! Credential management Service host(s) and compute element(s) Job events SEG GT4 Container Compute element GRAM services Local job control GRAM services Local scheduler Job functions sudo GRAM adapter Delegate Transfer request Client Delegation Delegate GridFTP User job RFT File Transfer FTP control FTP data Remote storage element(s) File staging GridFTP GT4 Web Services GRAM Globus Toolkit Version 4: Software for Service-Oriented Systems, LNCS 3779, 2-13, 2005
GT4 Application Hosting Service Prototype Joint work with Ravi Madduri
+ + + + + + + = Provisioning:Astro Portal Stacking Service • Purpose • On-demand “stacks” of random locations within ~10TB dataset • Challenge • Rapid access to 10-10K “random” files • Time-varying load • Solution • Dynamic acquisition of compute, storage Sloan Data S4 Web page or Web Service Joint work with Ioan Raicu & Alex Szalay
Preliminary Performance (TeraGrid, LAN GPFS) Joint work with Ioan Raicu & Alex Szalay
Next Steps • Integration with service authoring tools • More sophisticated provisioning • Dynamic, as in AstroPortal stacking service • More sophisticated management • Remote policy management • Auditing & accounting • Harmonization of different approaches • We welcome partnerships • On extending the technology • On applying the technology
Summary • “Science 2.0” or “service-oriented science” is not only sexy, it’s important! • Making it work requires that we get a handle on how to host & manage services • Application hosting services are an important part of the mix • GT4 provides most of the pieces needed to build a good application hosting service • We are looking for partners to extend & apply this technology Thanks to Carl Kesselman, Matei Ripeanu, Ravi Madduri for discussions