100 likes | 274 Views
AMUN. Authors: R. Baker (Cornell University, Ithaca, NY USA) L. Zhou (University of Florida, Gainesville, FL USA) J. Duboscq (Ohio State University, Columbus, OH USA) Presented by: D. Mimnagh (University of Texas, Austin, TX USA).
E N D
AMUN Authors: R. Baker (Cornell University, Ithaca, NY USA) L. Zhou (University of Florida, Gainesville, FL USA) J. Duboscq (Ohio State University, Columbus, OH USA) Presented by: D. Mimnagh (University of Texas, Austin, TX USA) A Practical Application Using the Nile Distributed Operating System CHEP2000
Overview • What is Nile? • What is AMUN? • Results • Conclusions CHEP2000
What is Nile? • Nile: Distributed computing solution for CLEO • fault-tolerant (recover from resource failure) • self-managing (sophisticated resource scheduling) • heterogeneous (will run anything anywhere) • Designed for HEP • track reconstruction • data analysis • simulation • But very generic CHEP2000
Nile Architecture CHEP2000
What is AMUN? • Advanced Monte Carlo Under Nile • CLEO II.V signal Monte Carlo • τ lepton pair events • Testbed • Nile control system using RMI (see E272) • Borrowed workstation program CHEP2000
Managing Loaned Workstations • Prototype • csh scripts • list of machine owners • Must be easy and honest • simple configuration files creation • monitor usage remotely and locally • allow preemption for unexpected usage • need local space for intermediate results • Will be integrated with Nile in Java CHEP2000
Nile performance Results • Very stable • weeks of uninterrupted use • Heterogeneity • as many as 60 machines, Alpha Linux + Unix • SpecInt ranging from 1 to 25 • Scaling • linear • Network topology issues can break linearity • 1-3 second to reschedule CPU CHEP2000
Scaling with Total SpecInt CHEP2000
Events Generated • Job construction requirements: • choose subjob size • collection script • 25 million τ events generated • as many as 1 million a day CHEP2000
Conclusion • Successful implementation of Nile in RMI • CPU resources used efficiently • loaned CPU • To do: • rewrite scripts in Java • admin tools • GUI tools CHEP2000