180 likes | 192 Views
What is the Matrix?. Arun swaran Jagatheesan San Diego Supercomputer Center. DAKS Meeting, 11/03/2003 SDSC . Credit / Acknowledgements. Participants Allen Ding Lucas Gilbert Reena Mathew Erik Vandiekieft ( IBM) Xi Cynthia Sheng Well Wishers Reagan Moore & SRB Team Kim Baldridge
E N D
What is the Matrix? Arun swaran Jagatheesan San Diego Supercomputer Center DAKS Meeting, 11/03/2003 SDSC
Credit / Acknowledgements • Participants • Allen Ding • Lucas Gilbert • Reena Mathew • Erik Vandiekieft ( IBM) • Xi Cynthia Sheng • Well Wishers • Reagan Moore & SRB Team • Kim Baldridge • YOU !!! • Sponsors • NSF GriPhyN, NSF SCEC, NPACI REU, NIH BIRN
Talk Outline • Workflow • Requirements for Grid Workflow • Data Grid Language • Matrix as a WfMS • Demonstrations • XQuery (CDL) • External Status Requests
Workflow • Automation of business process • Whole or Part • Documents/Information or tasks passed between participants • Based on a set of procedural rules • Scientific Computing Workflow • Computational research process as pathways or pipelines • Gather data, cleanse data, apply different combinations of transformations, simulations, visualization, publish in digital library, archive data, get Nobel prize (makes us also happy :-)
Workflow Description • Workflow description needed for processing • Current standards • BPEL4WS (Gaining Popularity) • Business Process Execution Language for Web Services • WSFL, XLANG; Merger of IBM, MS, … • XPDL • XML Process Definition Language • From Workflow Management Coalition • UML, WSCI,…
Grid Workflow • Granular Metadata • Context-based workflow, with control-based constructs • Query • Dynamic rule (ECA rules) to update milestones • Runtime Query on status of other activities • XQuery subset - Uniform query for data and process • Looping • Scientific Computing more looping structures • Interest in execution of each iteration and change of state • No support from existing standards • http://tmitwww.tm.tue.nl/research/patterns/standards.htm
Grid Workflow Description (Addl. Req.) • Facilitate modeling using a GUI as a workflow graph • Describe rules that can be updated dynamically • Embedded ECA-like rules within workflow description • Describe looping structures - where the state of each iteration is visible to applications • Dynamic query description for inter and intra-process coordination during workflow • Context based workflow (not purely control based) • Support import and export of definitions to be executed in P2P environment of workflow systems (and also support composition of services)
Data Grid Language • XML based asynchronous protocol • Describe Workflow control structures (Sequence, Parallel Split,..) • Describe data sets, collections, datagrid operations, ... • Access and manage data grids, data flow pipelines • Query on data resource (based on W3C XQuery subset) • Describe Rules and Context based Grid Workflow* • Query on Process meta-data, state* • Implementation • Reference Implementation by SDSC Matrix Project * Being Designed/developed as of the presentation date
Grid Workflow Process I Workflow Description Data Grid Language End User
Planner Concrete Workflow Grid Workflow Process II Abstract Workflow Data Grid Language
Grid Workflow Process III Grid Workflow Processor Concrete Workflow Export Workflow to Matrix P2P
Matrix Server • Acts as a Peer in WfMS P2P System * • Processes Data Grid Requests • Can maintain state an manage process steps • Can invoke SRB data grid processes, OGSA-Services, WSDL Services (OGSA Threads to be implemented) • Implemented as an Open-source Project * Being Designed/developed as of the presentation date
Implementation Status • Data Grid Language Schema for basic workflow constructs, Data Grid Operations • Matrix agents for executing data grid requests • Basic process pipeline management • Data Grid Language: Rules, Embedded query, OGSA operations to be added • Matrix: P2P, export/sharing of workflow to be added
SDSC Matrix Architecture SOAP Service Wrapper Abstraction Event Publish Subscribe, Notification JMS Messaging System JAXM Wrapper OGSA RPC-Style for SOAP Matrix Data Grid Request Processor Status Query Handler Pipeline Query Processor Transaction Handler Flow Handler and Execution Manager XQuery Processor Termination Handler Data flow pipeline Meta data Manager Matrix Agent Abstraction Persistence (Store) Abstraction Other Data Services OGSA Agent WSDL Agent SRB Agents JDBC In Memory Store
Conclusion • Data Grid Language • Grid Workflow Description • Basic Stuff or foundation ready • Solid Design to handle more complex stuff • Workflow Modeling not investigated (like Ptolemy?) • Matrix Server Implementation • Create, Query, Manage Grid Workflows • OGSA, Rules, P2P to be implemented • More Support will expedite R&D
Demos ? He is trying to escape. Where are the Demos?