250 likes | 406 Views
ApGrid Demonstration. Putchong Uthayopas, Sugree Phatanapherom, Parallel Research Group, Department of Computer Engineering Faculty of Engineering, Kasetsart University Bangkok, Thailand. Introduction. Motivation
E N D
ApGrid Demonstration Putchong Uthayopas, Sugree Phatanapherom, Parallel Research Group, Department of Computer Engineering Faculty of Engineering, Kasetsart University Bangkok, Thailand.
Introduction • Motivation • Explore technical problem in the deployment of grid system in Asia pacific level using resources provided by ApGrid test-bed • Develop initial ad hoc infrastructure that allows researcher to start using ApGrid test bed • Explore some innovative ideas and applications of the international grid • Stimulate the collaboration among Asia Pacific Grid researchers • Participants • HKU (China) • AIST (Japan) • KU, KMITNB, NECTEC (Thailand)
About the ApGrid Demo • Hardware • 14 nodes 42 processors in 3 countries and 5 organizations are used. • SUN, Alpha, x86 (Intel and AMD) • Software • Globus Toolkit 2.0 • Job Manager • Condor, PBS, and SQMS • Signed Certificate by AIST are used • /C=JP/O=AIST GTRC/CN=<full name>/Email=<e-mail> • /C=JP/O=AIST GTRC/CN=host/<fqdn> • /C=JP/O=AIST GTRC/CN=ldap/<fqdn>
Software Infrastructure Applications (CFD, Rendering, Monitoring) Globus 2 AIST HKU ThaiGrid KU KMITNB NECTEC
MDS mds.apgrid.org ApGrid AIST HKU Thai NECTEC KOUME (4/4) Grid (4/4) RDC4 (1/1) PALM (1/1) Alpha (1/2) AMATA (1/28) KMITNB (Node/CPU) PRG (2/2) KU
Demonstration Applications • Grid Movie Rendering • Computation • Matrix Multiplication • Equation solver using Gauss Elimination • Computational Fluid Dynamic (Heat Transfer)
query available jobmanager submit dispatch SCE /Grid gatekeeper GASS server Job Manager grid-enabled povray submit animation povray grid-enabled povray Local Scheduler source Grid Movie Rendering stage exe / io dispatch stage output stage inputs povray animation povray source output frames output frames
Bandwidth Measurement • Using Globus to help running bandwidth measurement • Algorithm: • Start Iperf server on one Grid Point • Start Iperf client on all grid point and measure the bandwidth to iper server • Repeat the measurement for all point
Task Execution Model and Parameters • Tqg - Grid queuing latency time • Depend on Policy, Number of Grid level jobs • Tsl – Launcher staging time • Tql - Local queuing latency time • Tsexec – Executable staging time • Tsinput – Input staging time • Texec - Execution time • Tsoutput – Output staging time
Application Characteristics Grid Movies Rendering Matrix Multiplication, Gauss Elimination CFD (Heat Conduction)
Results • MRB (Minimum required Bandwidthe) • Minimum Bandwidth that the communication overhead time is less than execution time of a problem • Depend on Bandwidth and remote computing power • Can be used to decide which application is practical to run on the grid • Example: • MRB between KU and HKU for each test application • Povray - 128 Kbps • MMul - 8 Mbps • Gauss - 4 Mbps • Heat - 16 Mbps
Recommendation 1 • Minimize traffic • Scheduler • Dispatch grid jobs at low-traffic time to get the maximum bandwidth • Traffic aware scheduler • Continuous traffic monitoring is important • Launcher • Stage output back at low-traffic time • Co-scheduling the input/executable/output staging task among multiple jobs to fully utilized network bandwidth
Recommendation 2 • Minimize Execution Staging Latency • Schedule the pre-staging of executable • Reuse the executable code • Break application into dynamics link module and pre-stage DLL module
Problems • MDS • Unstable (2.1) • Firewall • gatekeeper 2119 • giis/gris 2135 • gsiftp 2811 • other tcp range gass server gass client establish send request disconnect listenforreply establish reply disconnect HKU 40000-40050 KU 2001-10000 NECTEC ????
Problems • Bandwidth Variation • KU-HKU • Maximum 1 Mbps • Minimum 1 Kbps or less • AIST-KMITNB • AIST->KMITNB 2.5 Kbps • KMITNB->AIST 365 Kbps
Summary • This demo is a crucial step in • Establishing a workable infrastructure that utilize ApGrid Resources • Explore the challenging issues on the real system • Grid is working but appropriate for application with high computation/communication ratio
Next Step • Establish the Monitoring Infrastructure and a monitoring portal that allows human and software entities on Apgrid to access the traffic information. • ApGrid FAQ • Testing more applications on more site • Drug Design, Chemical Reaction ( Gamess) • Find a way to increase bandwidth between participating test site
Acknowledgement • Yoshio Tanaka, AIST • Choli Wang,Roy C. Ho, HKU • Sissades Tongsima,Kittinarong Laongvaree, NECTEC • Vara Varavidthaya, KMITNB