80 likes | 223 Views
Disruptive Technologies in HPC Interconnect. HPC User Forum April 16, 2012. Why Virtualization in HPC?. Benefits of Virtual Clusters Efficient c luster management Snapshots (checkpoint/restart ) Multi-tenant environments Live workload migration. High Performance Computing
E N D
Disruptive Technologies in HPC Interconnect HPC User Forum April 16, 2012
Why Virtualization in HPC? • Benefits of Virtual Clusters • Efficient cluster management • Snapshots (checkpoint/restart) • Multi-tenant environments • Live workload migration • High Performance Computing • Single tasks across multiple CPUs • Traditional virtualization in enterprise • Many tasks across multiple CPUs • Workload consolidation on less machines • Oversubscription of resources HPC App HPC App HPC App HPC App HPC App HPC App HPC App HPC App Operating System Operating System Virtualization Layer
High Performance Virtualization over SR-IOV • HCA shared among Guest OS’es • Virtual Functions assigned to VM • Physical Function controlled by Hypervisor • Complete RDMA software stack run in VMs • Very little throughput and latency overhead • Work in progress to support advance features such as live migration over SR-IOV VM VM VM RDMA Stack RDMA Stack RDMA Stack Hypervisor VF VF VF PF InfiniBand HCA
Co-Design Strategy for Scalable Computing Server/Storage Server/Storage Application Application MPI SHMEM/PGAS Communication Libraries Communication Libraries Bottlenecks Extension of I/O communications (RDMA, collectives, synchronization etc) Network Throughput Latency Message Rate Transport/Collective Offloads Network
Utilizing Co-Design – Mellanox ScalableHPC Mellanox Co-Design MXM - MellanoX Messaging FCA – Fabric Collective Accelerations