260 likes | 390 Views
gUSE: grid User Support Environment. Peter Kacsuk, Krisztian Kar o czkai, Andras Schnautigel, Istvan Marton, Gabor Herman MTA SZTAKI www.lpds.sztaki.hu. Content. Motivations Lessons learnt from P - GRADE portal Lessons learnt from accessing production Grid infrastructures
E N D
gUSE: grid User Support Environment Peter Kacsuk, Krisztian Karoczkai, Andras Schnautigel, Istvan Marton, Gabor Herman MTA SZTAKI www.lpds.sztaki.hu
Content • Motivations • Lessons learnt from P-GRADE portal • Lessons learnt from accessing production Grid infrastructures • Lessons learnt from providing multi-grid service • The service-oriented architecture of gUSE • Services in gUSE • Workflow concept of gUSE • Parameter sweep support of gUSE • CancerGrid • Usage of gUSE • EDGeS • Conclusions 3rd EGEE User Forum
Lessons learnt from P-GRADE portal • Popular because it provides • Easy-to-use but powerful workflow system (graphical editor, wf manager, etc.) • Easy-to-use parameter sweep concept support • Easy-to-use MPI program execution support • Multi-grid/multi-VO access mechanism (job submission grid interoperability at workflow level) for LCG-2, gLite and GT2 • Its extension with GEMLCA enables • The usage of legacy codes as grid-enabled services • The usage of service/job repository • Access to SRB and OGSA-DAI • Multi-grid/multi-VO access mechanism for LCG-2, gLite, GT2 and GT4 • Data management level of grid interoperability 3rd EGEE User Forum
Popularity of P-GRADE portal • It has been used in many EGEE and EGEE-related VOs: • GILDA, VOCE, SEE-GRID, BalticGrid, BioInfoGrid, EGRID, etc. • It has been used in many national grids: • UK NGS (a GT2-based grid), Grid-Ireland, Turkish Grid, Croatian Grid, Ukrainan Grid, etc. • It has been used as the GIN VO Resource Testing Portal • It became OSS in the beginning of Januar 2008: https://sourceforge.net/projects/pgportal/ 3rd EGEE User Forum
Download of OSS P-GRADE portal 130 downloads within a month 3rd EGEE User Forum
Limitations of P-GRADE portal • Restricted workflow capabilities • No cycle construct, no if-then-else, no embedding • Static parameter sweep capabilities • PS can not be used inside a workflow • Single user view • Too simple for IT people • Too complicated for end-users • Lack of collaborative tools supporting user communities • Monolithic architecture and as a result problems with • Scalability: • simultaneous number of jobs in the range of 100s • simultaneous number of users in the range of 30-50 • Adaptivity: difficult to adapt to new grid services 3rd EGEE User Forum
Lessons learnt from accessing production Grid infrastructures • Production Grids do not enable you to modify anything, just use their services (no matter they are good or bad) • Usually they provide basic grid services • The user should construct higher level services • However, if you do not want to be locked with one particular grid the user-written service should be interoperable with many basic grid services provided by different grids 3rd EGEE User Forum
Motivations of creating gUSE • We wanted • to overcome the limitations of the current P-GRADE portal • To create a set of high-level grid services that can be used with many different grids • Therefore we have defined a new service-oriented grid layer that can be deployed • on a single machine • on a cluster • on different grid sites as Web Services • Performance comparison • P-GRADE portal monolithic architecture: 100-200 jobs • WS-PGRADE/gUSE SOA architecture: 10.000 jobs 3rd EGEE User Forum
Monolithic architecture of P-GRADE portal A single Web container Single computer Workflow save read WEB UI File storage special file formats WFS and file Storage Workflow submit special protocol Workflow Engine Read Workflow to run special protocol Built-in Grid API + Hack for non-supported APIs Grid ClientS 3rd EGEE User Forum
gUSE architecture Graphical User Interface: WS-PGRADE Gridsphere portlets gUSE Filestorage Workflowstorage Autonomous Services: high level middleware service layer informationsystem WorkflowEngine Applicationrepository Broker/ Meta-broker Submitters Logging gLite or Globus or Web service: low level middleware service layer gLite resources, Globus resources and Web services
Generic service communication scheme in gUSE Definition of server functions Definition of client functions Function definitions Service Interface Client Interface RPC Service request RPC server Front-end implementation Client Implementation Service Front-end Concrete implementation of Service calls Service Logic Service Back-end Function implementations 3rd EGEE User Forum
Distributed SOA architecture Web container WEB UI Workflow list and config descriptor 8 WF Storage Special file formats inside File Storage Special file formats inside Status back Workflow Submit WFS 1 3 2 WFE Workflow Executor 5 Workflow descriptor File Storage 7 Status back 6 Job Submit Job info 4 Grid ClientS Files needed for wf execution Grid Api 3rd EGEE User Forum
Application developers’ view • Users of gUSE can be either • grid application developers • or end-users. • Application developers can develop sophisticated workflow applications where workflows can be • embedded into each other at any depth • recursive workflows are allowed • gUSE supports the following workflow types • graphs (abstract workflows) • workflow templates • concrete workflows • workflow instances • Parametric sweep nodes and normal nodes can be used in a mixed way. 3rd EGEE User Forum
Collaboration support between user communities • Application developers can • publish • incomplete wf applications (projects), wf parts (templates, graphs, concrete wf, wf instances) into a workflow repository for the use of other developers • ready-to-run wf applications for end-users • import workflows from the repository and can continue the work on them even if they were published by other developers • End-users can • import ready-to-run wf applications from the repository • execute ready-to-run wf applications imported from the repository based on a simplified portal interface hiding grid details • Grid is exposed only for application developers. 3rd EGEE User Forum
User activities New Edit, CopyDelete Edit Graph Template Constraints,Comments,Form Generators Jobs,Edges,Ports New New New Export Configure,Copy, Delete Repository Item Concrete Workflow Applications, Projects, Workflow part(G,T,CW,WI) Algorithms,Resource references,Inputs Import Submit Observe,Download,Suspend,Delete Workflow Instance Running state,Outputs 3rd EGEE User Forum
The workflow concept of gUSE • The workflow concept of gUSE is much more flexible than P-GRADE portal and many other workflow systems • Its DAG topology is extended with • embedded WFs • recursive embedded WFs • parameter sweep nodes • conditional control mechanism • special workflow starting control mechanisms based on • external events or • periodic timing 3rd EGEE User Forum
Workflow Graph: Overview Input Port Node: job, service call (WS, legacy), wf Output Port The Workflow Editor as it appears for the user 3rd EGEE User Forum
*K Legend: Cross Product Dot Product Configuring the Workflow: Overview Determine number of accepted files on external input Ports m n h Generator job produces multiple data on the output port within one job submission step Determine Dot or Cross product relation of Input ports to define the number of job submissions 1 Determine Job to be Collector by defining a Gathering Input Port. The Job execution will be postponed until all input files to that Port have arrived 3rd EGEE User Forum
*K h m n m*n*h*K h S S S m*n Animation the number of generated output files In case of Generator job the number of job submissions may differ from the number of files on Output Ports m*n h*K In case of dot product the Job is submitted with input files having a common index number in each input Ports m*n h*K m*n h*K m*n*h*K S S=max(m*n,h*k) 1 1 S In case of cross product individual Job submission is generated for each possible input file combination S S S 3rd EGEE User Forum S
Generator job Generator job An example CancerGrid workflow N = 20e-30e, M = 100 => very large number of executions and files x1 NxM x1 xN xN xN NxM x1 xN xN xN NxM NxM 3rd EGEE User Forum
Interoperability support • gUSE supports: • grid interoperability • workflow interoperability • gUSE can easily be connected to any known grid middleware. It is already connected to GT2, GT4, LCG-2, gLite and WS based grid systems • gUSE can also be connected to local systems like clusters or supercomputers • It contains a built-in grid broker that can automatically distribute the jobs of a workflow into any of the connected grids • It can use other grid brokers like the gLite broker or GridWay 3rd EGEE User Forum
Appl. Repository WS-PGRADE gUSE Service Grid EGEE GlobalDEG LocalDEG LocalDEG LocalDEG Interoperability support: EDGeS • EDGeS: Enabling Desktop Grids for e-Science • To integrate EGEE with Desktop Grids • gUSE can provide the transparent access of EGEE and DGs University DG Volunteer DG LocalDEG 3rd EGEE User Forum
P-GRADE portal P-GRADE/GEMLCA portal 1st generation WS-PGRADE portal 2nd generation Family of user support products • P-GRADE portal and gUSE/WS-PGRADE represent a family of user support products • They support the whole range of user types: • Novice application developers: 1st generation P-GRADE portals Advances application developers: 2nd generation WS-PGRADE portal developer view • End-users without grid knowledge: 2nd generation WS-PGRADE portal end-user view 3rd EGEE User Forum
Family of P-GRADE products and their use • P-GRADE • Parallelizing applications for clusters and grids • P-GRADE portal • Creating simple workflow and parameter sweep applications for grids • P-GRADE/GEMLCA portal • Creating workflow applications using legacy codes and community codes from repository • gUSE/WS-PGRADE • Creating complex workflow and parameter sweep applications for clusters, service grids and desktop grids • Creating workflow applications using embedded workflows, legacy codes and community workflows from workflow repository 3rd EGEE User Forum
Conclusions / Future plans • gUSE solves all the limitation problems of P-GRADE portal: • Implementation of gUSE is highly scalable, can be distributed on a cluster or even on different grid sites. • Stress tests show that it can simultaneously serve thousands of jobs • Its workflow concept is much more expressive than in P-GRADE portal (recursive wf, generic PS support, etc.) • Its user interface called as WS-PGRADE provides a graphical workflow editor that is much faster than the one in P-GRADE portal • gUSE provides a workflow repository and its use by end-users and application developers • gUSE solves grid interoperability at workflow level • among service grids • between service grids and desktop grids (see EDGeS project) 3rd EGEE User Forum
Roadmap of gUSE • First version was demonstrated at SC’07 • First version will be released in March 2008 with full support for EGEE, GT2 and GT4 • Second version will be released in July 2008 with full support for desktop grids • Third version solving interoperability between EGEE and desktop grids will be released by SC’08 3rd EGEE User Forum