240 likes | 347 Views
Live migration of Virtual Machines. Nour Stefan, SCPD. Introduction Related work Design Writable Working Sets Implementation Issues Evaluation Future work Conclusions. Introduction. OS virtualization Data centers Cluster computing Live OS migration
E N D
Live migration of Virtual Machines Nour Stefan, SCPD
Introduction • Related work • Design • Writable Working Sets • Implementation Issues • Evaluation • Future work • Conclusions
Introduction • OS virtualization • Data centers • Cluster computing • Live OS migration • Avoid problem of “residual dependencies” • In-memory state can be transferred in a consistent and efficient way • Kernel-internal state • Application-level state • Separation of concerns between Users and Operator of a data center or cluster • Separation of hardware and software considerations, and consolidating clustered hardware into a single coherent management domain • High-performance migration support for Xen
Related work • Collective project • For slow connections and longer time spans • Stop the OS execution while transfer • Zap • NomadBIOS • Pre-copy migration • Not adapting to the writable working set
Design • Migrating memory • Balancing Downtime and Total migration time • Push phase • Stop-and-copy phase • Pull phase • Local resources • Connections to local devices(disks , network interfaces) • Single switched LAN • Generate an unsolicited ARP reply from migrated host, advertising that the IP has moved to a new location • Network-Attached Storage
Implementation Issues • Managed migration • Performed largely outside the migratee • Migration daemons running in the management VM of the source and destination (new VM on destination) • Rounds of copying (dirtied during the previous round) • Dirty bitmap copied from Xen at start of each round • Shadow page tables (read-only page-tables entries => page fault trapped by Xen)
Implementation Issues • Self migration • Implemented within the migratee OS • Migration stub on destination machine • Consistent OS checkpointing • Two-stage stop-and-copy phase • Disables all OS activity except for migration => final scan of dirty bitmap => shadow buffer • Transfer shadow buffer
Implementation Issues • Dynamic Rate-Limiting • Rapid Page Dirtying
Evaluation • Test setup • Dual Intel Xeon 2GHz CPU and 2GB memory • TG3 broadband
Future work • Cluster management • Wide Area Network redirection • Migrating Block Devices
Conclusions • By integrating live OS migration into the Xenvirtual machine monitor we enable rapid movement of interactive workloads within clusters and data centers. Our dynamic network-bandwidth adaptation allows migration to proceed with minimal impact on running services, while reducing total downtime to below discernable thresholds. • Our comprehensive evaluation shows that realistic server workloads such as SPECweb99 can be migrated with just 210ms downtime, while a Quake3 game server is migrated with an imperceptible 60ms outage.