170 likes | 297 Views
GENI Science Shakedown. Paul Ruth RENCI / UNC Chapel Hill pruth@renci.org. Motivation. Motivation. Are the current GENI technologies capable of effectively supporting the computing needs of domain science? . Motivation.
E N D
GENI Science Shakedown Paul Ruth RENCI / UNC Chapel Hill pruth@renci.org
Motivation Are the current GENI technologies capable of effectively supporting the computing needs of domain science?
Motivation Are the current GENI technologies capable of effectively supporting the computing needs of domain science? What are the benefits and limitations of using GENI for domain science?
Observations GENI has matured to the point where it is now time to evaluate its capabilities with respect to domain science applications “at scale” and use the results of this evaluation to inform the direction of future GENI development.
Observations GENI has matured to the point where it is now time to evaluate its capabilities with respect to domain science applications “at scale” and use the results of this evaluation to inform the direction of future GENI development. There is an opportunity to introduce domain scientists to the GENI-way-of-thinking about networking and distributed systems in order to seed the dissemination of GENI technologies to existing and future high throughput and high-performance computing centers
Personnel • Computer Scientists: • Paul Ruth, pruth@renci.org • AnirbanMandal, anirban@renci.org • Domain Scientists: • Jeffery L. Tilson, jtilson@renci.org • Brian Blanton, bblanton@renci.org
Project Goals • Apply the GENI experiment workflow to domain science applications and evaluate GENI’s ability to run domain science experiments • Performance • Ease-of-use. • Provide feedback to GENI rack developers on the current capabilities with respect to science applications as well as target areas for improvement. • Build a tool for domain scientists to create RSpec from high-level descriptions of high-throughput and high-performance applications.
Project Goals • Apply the GENI experiment workflow to domain science applications and evaluate GENI’s ability to run domain science experiments (performance and ease-of-use). • Build a tool for domain scientists to create RSpec from high-level descriptions of high-throughput and high-performance applications. • Provide feedback to GENI rack developers on the current capabilities with respect to science applications as well as target areas for improvement. Bonus Goal: Attract domain scientists to GENI
Domain Science Applications • MotifNetwork • Jeffery L. Tilson (RENCI) • High Throughput Computational Genomics • Identification of functional domains and the identification of conserved functional relationships across large numbers of genomes. • Workflow ensemble (Pegasus/HTCondor) • Task data sets on the order of 50-100 GB. • Implicit iteration • Scales to 1000s of tasks
Domain Science Applications • ADCIRC • Brian Blanton (RENCI) • Storm surge and tide model • Finite element model • MPI tightly coupled • Urgent Computing • Approved by FEMA for computing storm surge flood hazard simulations • Used for Digital Flood Insurance Rate Maps (DFIRMs) • Scales to 10000+ MPI processes
Experiments • Provisioning overhead • creating, destroying, and modifying slices. • GENI for big-data. • Large workflow inputs/output • GENI for MPI. • Tightly coupled
Outreach to Domain Science • Tool for deploying domain science applications easily. • RSpec generation of a Pegasus/HTCondor cluster and MPI cluster. • Extendable to other science slice types. • Introducing domain scientists to GENI. • Feedback from domain scientists to inform future GENI development
Final thoughts on what we will find • Most problems will be on the edges • Disk I/O • Local disk • Network storage • Network virtualization mechanism • Tradeoffs of: virtio, OVS, linux bridges, SR-IOV • More overhead than we realize • Performance isolation • Something unexpected
First year goals • GEC 19 • Applications working on InstaGENI and ExoGENI • Initial performance profile • GEC 20 • Deeper performance profile • Initial suggestions for rack developers