140 likes | 271 Views
Virtual Machine Appliances for Ad-hoc, Opportunistic Grids. Renato Figueiredo ACIS Laboratory University of Florida. Overview. Goal: plug-and-play, easy to install software for opportunistic computing Use cases: Desktop campus grids Ad-hoc lab clusters Pooling across multiple domains
E N D
Virtual Machine Appliances for Ad-hoc, Opportunistic Grids Renato Figueiredo ACIS Laboratory University of Florida
Overview • Goal: plug-and-play, easy to install software for opportunistic computing • Use cases: • Desktop campus grids • Ad-hoc lab clusters • Pooling across multiple domains • Technologies: • Virtual machines (VMs, Xen, VMware) • Virtual networks (a la VPN) • Batch schedulers (Condor)
Virtual machines VM image Context WOWs • Wide-area • Virtual machines • Self-organizing overlay IP tunnels, P2P routing NOWs, COWs • Local-area • Physical machines • Self-organizing switching (e.g. Ethernet spanning tree) Installation image Switched network Physical machines
SURAGrid context • This can be a vehicle for dissemination of Grid middleware to institutions • E.g. facilitate the deployment of desktop campus grids • Can pool resources together across multiple sites through Condor flocking • Complementary to the existing SURAgrid setup of more traditional cluster infrastructures
1) System Virtual Machines • Virtualization of instruction sets (ISAs) • Language-independent, binary-compatible (not JVM) • VMware, Microsoft, Xen, Parallels, … Intel VT, AMD Pacifica • ISA + OS + libraries + software = execution environment • Time-share multiple O/Ss • Near-native performance for CPU-intensive workloads
Networking VMs • System VM isolates user from host • Great! Now how do I access it? • Users: want full TCP/IP connectivity • Facilitate programming and deployment • But cross-domain communication subject to NAT, firewall policies • Providers: want to isolate traffic • Users with admin privileges inside VM still pose security problems: viruses, DoS
2) Virtual networking • Isolation: dealt with similarly to VMs • Multiple, isolated virtual networks time-share physical network • Key technique: tunneling • Similar to VPNs • Our approach: peer-to-peer network tunnelling • Virtual network should be self-configured • Avoid administrative overhead of VPNs • Including cross-domain NAT traversal • Virtual network should be isolated • Virtual private address space decoupled from Internet address space
Example – physical machines Hosts: 2.4GHz Xeon, Linux 2.4.20, VMware GSX Host: 1.3GHz P-III Linux 2.4.21 VMPlayer Host: 1.7GHz P4, Win XP SP2, VMPlayer Wide-area Overlay of virtual Workstations (WOW) 34 compute nodes, 118-node PlanetLab P2P routers
Example: virtual view Looks like a cluster Heterogeneous hardware, but homogeneous software PBS scheduler and NFS server on head node, 32 worker nodes 4000 jobs 1 job/second PBS head node NFS server WOW worker nodes
3) Condor • High-throughput computing scheduler • University of Wisconsin – circa 1988 • Fault-tolerant, scalable, flexible • Pools of few machines to 1000s exist today • Standard and vanilla worlds • Condor-linked – checkpointing & migration • Vanilla – unmodified applications • Great for long-running, parameter sweeping sequential jobs • Easy to submit large # jobs from single command • Flocking • Enable jobs submitted to a local pool that is 100% utilized to “flock” to remote pools
Use case: putting things together • A VM “appliance” for Condor-based opportunistic computing • We created and tailored the appliance • Takes expertise and time • E.g. a Condor appliance that self-configures ad-hoc pools with flocking • Can complement with well-packaged examples and documentation for users to get started quickly • Users download and boot VM up • This takes zero configuration • VM acquires a virtual IP address and becomes routable on the overlay, becoming a resource • E.g. allowing Condor jobs to run and to be submitted
Demonstration • You can try out this software on any x86-based Windows or Linux PC • http://www.acis.ufl.edu/~ipop/grid_appliance • Follow README file: • Should be able to install VMware and boot up appliance in 15-30 minutes • And run a demo Condor job on an ad-hoc pool with machines at UF right after boot up
Demo screenshot VMware Condor pool access Debian VM Windows XP host
For further information • http://wow.acis.ufl.edu • Code, papers • http://www.cs.wisc.edu/condor • Plentiful info on Condor • http://www.vmware.com • Free VMs for x86 Linux/Windows (Player, Server)