370 likes | 487 Views
Inferring the Topology and Traffic Load of Parallel Programs in a VM environment. Ashish Gupta Peter Dinda Department of Computer Science Northwestern University. Overview. Motivation behind parallel programs in a VM environment Goal: To infer the communication behavior
E N D
Inferring the Topology and Traffic Load of Parallel Programs in a VM environment Ashish Gupta Peter Dinda Department of Computer Science Northwestern University
Overview • Motivation behind parallel programs in a VM environment • Goal: To infer the communication behavior • Offline implementation • Evaluating with parallel benchmarks • Online Monitoring in a VM environment • Conclusions
Motivation • A distributed computing environment based on Virtual Machines • Raw machines connected to user’s network • Our Focus: Middleware support to hide the Grid complexity
Motivation • A distributed computing environment based on Virtual Machines • Raw machines connected to user’s network • Our Focus: Middleware support to hide the Grid complexity • Our goal here: Efficient execution of Parallel applications in such an environment
Parallel Application Behavior Intelligent Placement and virtual networking of parallel applications Virtual Networks With VNET VM Encapsulation
VNET • Abstraction: A set of VMs on same Layer 2 network • Virtual Ethernet LAN
Goal of this project Application Topology An online topology inference framework for a VM environment ? Low Level Traffic Monitoring
Approach Design an offline framework Evaluate with parallel benchmarks If successful, design an online framework for VMs
An offline topology inference framework Goal: A test-bed for traffic monitoring and evaluating topology inference methods
The offline method Synced Parallel Traffic Monitoring Traffic Filtering and Matrix Generation Matrix Analysis and Topology Characterization
The offline method Synced Parallel Traffic Monitoring Traffic Filtering and Matrix Generation Matrix Analysis and Topology Characterization
The offline method Synced Parallel Traffic Monitoring Traffic Filtering and Matrix Generation Matrix Analysis and Topology Characterization
The offline method Synced Parallel Traffic Monitoring Traffic Filtering and Matrix Generation Matrix Analysis and Topology Characterization
The offline method Synced Parallel Traffic Monitoring Traffic Filtering and Matrix Generation Matrix Analysis and Topology Characterization PVMPOV Inference
Synced Parallel Traffic Monitoring Traffic Filtering and Matrix Generation Matrix Analysis and Topology Characterization Infer.pl
Parallel Benchmarks Evaluation Goal: To test the practicality of low level traffic based inference
Parallel Benchmarks used 1 2 3 • Synthetic benchmarks: Patterns • N-dimensional mesh-neighbor • N-dimensional toroid-neighbor • N-dimensional hypercubes • Tree reduction • All-to-All • Scheduling mechanism to generate deadlock free and efficient schemes
Application benchmarks • NAS PVM benchmarks • Popular benchmarks for parallel computing • 5 benchmarks • PVM-POV : Distributed Ray Tracing • Many others possible… • The inference not PVM specific • Applicable to all communication . • e.g. MPI, even non-parallel apps
Patterns application 3-D Toroid 3-D Hypercube 2-D Mesh Reduction Tree All-to-All
PVM NAS benchmarks Parallel Integer Sort
Traffic Matrix for PVM IS benchmark Placement of host1 is crucial on the network
An Online Topology Inference Framework: VTTIF Goal: To automatically detect, monitor and report the global traffic matrix for a set of VMs running on a overlay network
Overall Design • VNET • Abstraction: A set of VMs on same Layer 2 network • Virtual Ethernet LAN
A VNET virtual layer A Virtual LAN over wide area VNET Layer Physical Layer
Overall Design • VNET • Abstraction: A set of VMs on same Layer 2 network • Extend VNET to include the required features • Monitoring at Ethernet packet level • The Challenge here • Lacks manual control • Detecting interesting parallel program communication ?
Detecting interesting phenomenon • Certain address properties • Based on Traffic rate • Etc. Provide support for queries by external agent Rate based monitoring Non-uniform discrete event sampling What is the Traffic Matrix for the last n seconds ? Reactive Mechanisms Proactive Mechanisms Like a Burglar Alarm Video Surveillance
Physical Host VM VNET daemon VNET overlay network Traffic Analyzer Rate based Change detection Traffic Matrix Query Agent To other VNET daemons VM Network Scheduling Agent
Traffic Matrix Aggregation • Each VNET daemon keeps track of local traffic matrix • Need to aggregate this information for a global view • When the rate falls, the local daemons push the traffic matrix (When do you push the traffic matrix ?) • Operation is associative: reduction trees for scalability The proxy daemon
Evaluation • Used 4 Virtual Machines over VNET • NAS IS benchmark
Conclusions A Traffic Inference Framework for Virtual Machines Ready to move on to future steps: Adaptation for Performance Possible to infer the topology with low level traffic monitoring
Current Work • Capabilities for dynamic adaptation into VNET • Spatial Inference Network Adaptation for Improved Performance • Prelim Results: Improved performance upto 40% in execution time • Looking into benefits of Dynamic Adaptation
For more information • http://virtuoso.cs.northwestern.edu • VNET is available for download • PLAB web site: plab.cs.northwestern.edu