500 likes | 628 Views
Parallel Job Deployment and Monitoring in a Hierarchy of Mobile Agents. Munehiro Fukuda Computing & Software Systems, University of Washington, Bothell Funded by. Outline. Introduction Execution Model System Design Performance Evaluation Related Work Conclusions. 1. Introduction.
E N D
Parallel Job Deployment and Monitoring in a Hierarchy of Mobile Agents Munehiro Fukuda Computing & Software Systems, University of Washington, Bothell Funded by CSS Speaker Series
Outline • Introduction • Execution Model • System Design • Performance Evaluation • Related Work • Conclusions CSS Speaker Series
1. Introduction • Problems in Grid Computing • Background of Mobile Agents • Objective • Project Overview CSS Speaker Series
Quiet Laboratories • UW1-320 and UW1-302 at 3pm on a weekday • No more computing resources needed? CSS Speaker Series
Demands for Computing ResourcesIn Teaching • I'm in the 320 lab testing my program, and for some reason, whenever I attempt to use 15 hosts, it asks me for passwords for hosts 31 - 18 and then freezes and does nothing. • I noticed that uw1-320-20 is being bogged down by zombie processes that someone left going on it: • I went around looking on some of the other computers, and its all over the place: user A has almost 40 processes running on uw1-320-30 since April 23rd, user B has about 10 on host 29, and there’s a ton more on almost every host. • I got tired of manually running a bunch of ssh commands to run rmi on many different machines. • I have narrowed the problem down to three machines: 16, 20, and 30. First of all uw1-320-20 is dead. It drops all incoming ssh connections. The other two, uw1-320-16 and uw1-320-30, both have a mysterious problem that I don't know how to solve. CSS Speaker Series
Demands for Computing ResourcesIn Research • http://setiathome.berkeley.edu/ • http://boinc.bakerlab.org/rosetta/rah_about.php • These are an effective way to collect numerous computing resources from all over the world. • But, here is a question: • Why don’t they use idle machines on their campuses first? CSS Speaker Series
Grid-Computing Brokers • Desktops • Buyers: a desktop user • Sellers: hardware components • Brokers: Windows, Linux • Clusters • Buyers: multiple users (e.g., CSS434 students) • Sellers: cluster computing nodes • Brokers: PBS, LSF • Grid computing • Buyers: Seti@home, Rosseta@home, etc. • Brokers: Globus, Condor, Legion (Avaki), NetSolve, Ninf, Entropia, etc. • Okay, no need to implement any more? CSS Speaker Series
Problems in Grid Computing • Targeting large business models • A central entry point • A lot of installation work http://www.globus.org/toolkit/docs/4.0/ • Little system faults • Too gigantic CSS Speaker Series
Our Target • Targeting a group of computer users • No central entry point • No central managers • No programming model restrictions • Easy installation work • Easy participation but necessity of fault tolerance Network CSS Speaker Series
Background of Mobile Agents • An execution model previously highlighted as a prospective infrastructure of distributed systems. • Static job deployment and result collection: No more than an alternative approach to centralized grid middleware implementation • Our goal: Let mobile agents do unique tasks in grid computing FTP Cycle Cycle Cycle Central manger HTTP Server Server Server RPC User Internet CSS Speaker Series
Focus on a group of independent computers Turned on and off independently Not controlled by a scheduler such as PBS and LSF Not managed by a central server Let mobile agents do unique tasks in grid computing Runtime job migration: Moving a program from a faulty/busy site to an active/idle site Seeking for fault tolerance and better load balancing Negotiation: Negotiating with other agents about computing resources Seeking for better load balancing Inherent parallelism: Deploying and monitoring jobs in parallel Decentralized job management Objective CSS Speaker Series
Project Overview • Funded by: NSF Middleware Initiative • Sponsored by: University of Washington • In Collaboration of: Ehime University • In a Team of: UWB Undergraduates CSS Speaker Series
2. Execution Model • System Overview • Execution Layer • Programming Environment CSS Speaker Series
Snapshot Methods Snapshot Methods GridTCP GridTCP User program wrapper User program wrapper Results Results snapshot snapshot snapshot User A User B FTP Server snapshots snapshots System Overview User A’s Process User A’s Process User B’s Process TCP Communication Snapshot Methods GridTCP User program wrapper Sentinel Agent Sentinel Agent Sentinel Agent Commander Agent Resource Agent Resource Agent Commander Agent CSS Speaker Series BookkeeperAgent Bookkeeper Agent
mpiJava-S mpiJava-A Execution Layer Java user applications mpiJava API Java socket GridTcp User program wrapper Commander, resource, sentinel, and bookkeeper agents UWAgents mobile agent execution platform Operating systems CSS Speaker Series
MPI Java Programming public class MyApplication { public GridIpEntry ipEntry[]; // used by the GridTcp socket library public int funcId; // used by the user program wrapper public GridTcp tcp; // the GridTcp error-recoverable socket public int nprocess; // #processors public int myRank; // processor id ( or mpi rank) public int func_0( String args[] ) { // constructor MPJ.Init( args, ipEntry, tcp );// invoke mpiJava-A .....; // more statements to be inserted return 1; // calls func_1( ) } public int func_1( ) { // called from func_0 if ( MPJ.COMM_WORLD.Rank( ) == 0 ) MPJ.COMM_WORLD.Send( ... ); else MPJ.COMM_WORLD.Recv( ... ); .....; // more statements to be inserted return 2; // calls func_2( ) } public int func_2( ) { // called from func_2, the last function .....; // more statements to be inserted MPJ.finalize( );// stops mpiJava-A return -2; // application terminated } } CSS Speaker Series
3. System Design • Mobile Agents • Job Coordination • Distribution • Resource allocation and monitoring • Resumption and migration • Programming Support • Language preprocessing • Communication check-pointing • Inter-Cluster Job Deployment (Current Research Topic) • Over-gateway agent migration • Over-gateway communication • Job distribution CSS Speaker Series
UWInject: submits a new agent from shell. id 0 Agent domain (time=3:30pm, 8/25/05 ip = medusa.uwb.edu name = fukuda) id 0 -m 4 -m 3 id 1 id 1 id 2 id 3 id 2 User A user job UWPlace id 12 Agent domain (time=3:31pm, 8/25/05 ip = perseus.uwb.edu name = fukuda) id 4 id 5 id 6 id 7 id 8 id 9 id 10 id 11 UWAgents – Concept of Agent Domain CSS Speaker Series
User snapshot snapshot Job Distribution Job Submission Commander id 0 XML Query Spawn eXist Resource id 1 Sentinel id 2 rank 0 Bookkeeper id 3 rank 0 Sensor id 4 Sensor id 5 Sentinel id 8 rank 1 Sentinel id 9 rank 2 Sentinel id 10 rank 3 Sentinel id 11 rank 4 Bookkeeper id 12 rank 1 Bookkeeper id 13 rank 2 Bookkeeper id 14 rank 3 Bookkeeper id 15 rank 4 id: agent id rank: MPI Rank Sentinel id 32 rank 5 Sentinel id 33 rank 6 Sentinel id 34 rank 7 Bookkeeper id 48 rank 5 Bookkeeper id 49 rank 6 Bookkeeper id 50 rank 7 CSS Speaker Series
Sensor id 4 Sensor id 5 User Node 1 Node 0 Node 2 Node 3 Node 4 Node5 Performance data ttcp Sensor id 16 Sensor id 17 Sensor id 18 Sensor id 19 Sensor id 20 Sensor id 21 Sensor id 22 Sensor id 23 ttcp ttcp Resource Allocation and Monitoring Job submission Our own XML DB total nodes x multiplier eXist Resource id 1 Commander id 0 An XML query A list of available nodes CPU Architecture OS Memory Disk Total nodes Multiplier Spawn Case 1: Total nodes = 2 Multiplier = 1.5 Sentinel id 2 rank 0 Sentinel id 8 rank 1 Sentinel id 2 rank 0 Bookkeeper id 12 rank 5 Sentinel id 8 rank 1 Bookkeeper id 2 rank 0 Bookkeeper id 12 rank 5 Bookkeeper id 2 rank 0 Case 2: Total nodes = 2 Multiplier = 3 Future use Future use Future use CSS Speaker Series
(2) Search for the latest snapshot (3) Retrieve the snapshot (4) Send a new agent (1) Detect a ping error New Sentinel id 11 rank 4 (5) Restart a user program (0) Send a new snapshot periodically Job Resumption by a Parent Sentinel Sentinel id 2 rank 0 MPI connections Sentinel id 8 rank 1 Sentinel id 9 rank 2 Sentinel id 10 rank 3 Sentinel id 11 rank 4 Bookkeeper id 15 rank 4 CSS Speaker Series
(12) Restart a new resource agent from its beginning Commander id 0 (11) Detect a ping error (13) Detect a ping error and follow the same child resumption procedure as in p9. (10) Send a new agent (6) No pings for 2 * 5 (= 10sec) (7) Search for the latest snapshot New Resource id 1 Sentinel id 2 rank 0 (2) Search for the latest snapshot (8) Search for the latest snapshot (1) No pings for 8 * 5 (= 40sec) (9) Retrieve the snapshot No pings for 12 * 5 (= 60sec) (5) Send a new agent (3) Search for the latest snapshot (4) Retrieve the snapshot Job Resumption by a Child Sentinel Commander id 0 New Resource id 1 Sentinel id 2 rank 0 Bookkeeper id 3 rank 0 Sentinel id 8 rank 1 Bookkeeper id 12 rank 1 CSS Speaker Series
User Program Wrapper User Program Wrapper Source Code func_0( ) { statement_1; statement_2; statement_3; return 1; } func_1( ) { statement_4; statement_5; statement_6; return 2; } func_2( ) { statement_7; statement_8; statement_9; return -2; } statement_1; statement_2; statement_3; check_point( ); statement_4; statement_5; statement_6; check_point( ); statement_7; statement_8; statement_9; check_point( ); int fid = 1; while( fid == -2) { switch( func_id ) { case 0: fid = func_0( ); case 1: fid = func_1( ); case 2: fid = func_2( ); } } check_point( ) { // save this object // including func_id // into a file } Preprocessed Cryptography CSS Speaker Series
Pre-proccesser and Drawback Preprocessed Code Preprocessed Source Code int func_0( ) { statement_1; statement_2; statement_3; return 1; } int func_1( ) { while(…) { statement_4; if (…) { statement_5; return 2; } else statement_7; statement_8; } } int func_2( ) { statement_6; statement_8; while(…) { statement_4; if (…) { statement_5; return 2; } else statement_7; statement8; } } statement_1; statement_2; statement_3; check_point( ); while (…) { statement_4; if (…) { statement_5; check_point( ); statement_6; } else statement_7; statement_8; } check_point( ); • No recursions • Useless source line numbers indicated upon errors • Still need of explicit snapshot points. Before check_point( ) in if-clause After check_point( ) in if-clause CSS Speaker Series
User Program Wrapper rank ip 1 n1.uwb.edu user program 2 n2.uwb.edu n3.uwb.edu incoming TCP ougoing backup User Program Wrapper rank ip 1 n1.uwb.edu user program 2 n3.uwb.edu incoming TCP ougoing backup GridTcp – Check-Pointed Connection User Program Wrapper user program TCP outgoing backup incoming Snapshot maintenance n1.uwb.edu n2.uwb.edu • Outgoing packets saved in a backup queue • All packets serialized in a backup file every check pointing • Upon a migration • Packets de-serialized from a backup file • Backup packets restored in outgoing queue • IP table updated n3.uwb.edu CSS Speaker Series
Inter-Cluster Job DeploymentCurrent Research Topic Sentinel id 2 Commander id 0 • Over-gateway agent deployment • Over-gateway TCP communication • Over-gateway agent tree creatioin How? Internet medusa.uwb.edu Sentinel id 8 Sentinel id 9 Private domain uw1-320-00.uwb.edu uw1-320-01.uwb.edu 10.0.0.3 10.0.0.4 10.0.0.7 CSS Speaker Series
UWAgents – Over Gateway Migration • Parent and children keep track of a route to each other’s current position. • A daemon maintains where a gateway is. talk( ) hop( ) spawnChild( ) id 1 id 1 id 0 Internet hop( ) medusa.uwb.edu hop( ) id 1 id 1 Private domain uw1-320-00.uwb.edu uw1-320-01.uwb.edu mnode0 mnode1 mnode4 CSS Speaker Series
User Program Wrapper User Program Wrapper User Program Wrapper rank rank rank dest dest dest gateway gateway gateway 0 0 0 uw1-320-00 uw1-320-00 uw1-320-00 medusa - - 1 1 1 mnode0 mnode0 mnode0 medusa - - 2 2 2 medusa medusa medusa - - - user program user program user program GridTcp – Over-Gateway Connection Sentinel id 9 rank 2 Sentinel id 2 rank 0 Commander id 0 Internet medusa.uwb.edu Sentinel id 8 rank 1 Private domain uw1-320-00.uwb.edu uw1-320-01.uwb.edu mnode0 mnode1 mnode4 CSS Speaker Series
User Over-Gateway Agent Tree CreationPossible Solutions Commander id 0 Partition 1 Partition 2 Resource id 1 Sentinel id 2 rank 0 Bookkeeper id 3 rank 0 Cluster 0 Sentinel id 8 rank 1 Sentinel id 9 rank 2 Sentinel id 10 rank 3 Sentinel id 11 rank 4 Cluster 2 Sentinel id 32 rank 5 Sentinel id 33 rank 6 Sentinel id 34 rank 7 Sentinel id 35 rank 8 Sentinel id 46 rank 19 Sentinel id 47 rank 20 Cluster 1 CSS Speaker Series
User Over-Gateway Agent Tree CreationFinal Solution Commander id 0 Sentinel id 2 Resource id 1 Bookkeeper id 3 rank 0 Cluster gateway 0 Desktop computers Sentinel id 8 rank -8 Sentinel id 9 rank X Cluster gateways 1, 2, and 3 Sentinel id 32 rank 0 Sentinel id 33 rank -33 Sentinel id 34 rank -34 Sentinel id 35 rank -35 Sentinel id 36 rank X+1 Sentinel id 37 rank X+2 Sentinel id 38 rank X+3 Sentinel id 39 rank X+4 Sentinel id 128 rank 1 Sentinel id 129 rank 2 Sentinel id 130 rank 3 Sentinel id 131 rank 4 Sentinel id 132 rank 6 Cluster 1 Cluster 3 Cluster 2 Cluster 0 Sentinel id 512 rank 5 Sentinel id 528 rank 7 Sentinel id 529 rank 8 Sentinel id 530 rank 9 Sentinel id 531 rank 10 CSS Speaker Series
4. Performance Evaluation • Evaluation Environment: • A 8-node Myrinet-2000 cluster: 2.8GHz pentium4-Xeon w/ 512MB • A 24-node Giga-Ethernet cluster: 3.4GHz Pentium4-Xeon w/512MB • Computation Granularity • Java Grande MPJ Benchmark • Process Resumption Overhead • File Transfer CSS Speaker Series
Computational Granularity 1 Master-slave computation Master Communication Slave Slave Slave Slave Slave CSS Speaker Series
Computational Granularity 2 Heartbeat communication Process Process Process Process Process Communication CSS Speaker Series
Computational Granularity 3 All to all broadcast Communication Process Process Process Process Process CSS Speaker Series
Performance Evaluation - Series Master-slave computation CSS Speaker Series
Performance Evaluation - RayTracer All reduce communication but few data to send CSS Speaker Series
Performance Evaluation – MolDyn All to all broadcast CSS Speaker Series
Overhead of Job Resumption CSS Speaker Series
User File Transfer Commander id 0 Resource id 1 Sentinel id 2 rank 0 Bookkeeper id 3 rank 0 Sentinel id 8 rank 1 Sentinel id 9 rank 2 Sentinel id 10 rank 3 Sentinel id 11 rank 4 AgentTeamwork vs NFS Pipelined Transfer in AgentTeamwork Sentinel id 32 rank 5 Sentinel id 33 rank 6 Sentinel id 34 rank 7 Sentinel id 35 rank 8 Sentinel id 46 rank 19 Sentinel id 47 rank 20 CSS Speaker Series
5. Related Work From the viewpoints of: • System Architecture • Fault Tolerance • Job Deployment and Monitoring CSS Speaker Series
System Architecture • Difference from Catalina/J-SEAL2 • They are not fully implemented. • They are based on a master-slave model CSS Speaker Series
Fault Tolerance CSS Speaker Series
Job Deployment and Monitoring CSS Speaker Series
6. Conclusions • Project Summary • Next Two Years CSS Speaker Series
Project summary • Applications • Computation granularity: 40,000 doubles x 10,000 floating-point operations • Message transfer: Any types except all-to-all communication • Entire application size: 3+ times larger than computation granularity • Current status • UWAgent: completed • Agent behavioral design: basic job deployment/resumption implemented • User program wrapper: completed including security features • GridTcp/mpiJava: in testing • Preprocessor: almost completed CSS Speaker Series
Next Two Years • Application support • Fault tolerance in file transfer • GUI improvement • Agent algorithms • Over-gateway application deployment • Dynamic resource allocation and monitoring • Priority-based agent migration • Performance evaluation • Dissemination CSS Speaker Series
AgentTeamwork Can AgentTeamwork Become Their Competitor? Nimrod CSS Speaker Series
Questions? CSS Speaker Series
MPJ.Send and Recv Performance CSS Speaker Series
Mobile Agents CSS Speaker Series