1 / 23

Evaluation of Delta Compression Techniques for Efficient Live Migration of Large Virtual Machines

Evaluation of Delta Compression Techniques for Efficient Live Migration of Large Virtual Machines. Petter Svärd, Benoit Hudzia , Johan Tordsson and Erik Elmroth Umeå University, Dept of Computing Science VEE 2011, Newport Beach, CA, USA. Live migration.

micheal
Download Presentation

Evaluation of Delta Compression Techniques for Efficient Live Migration of Large Virtual Machines

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Evaluation of Delta Compression Techniques for Efficient Live Migration of Large Virtual Machines Petter Svärd, Benoit Hudzia, Johan Tordsson and Erik Elmroth Umeå University, DeptofComputing Science VEE 2011, Newport Beach, CA, USA

  2. Live migration “Transfer a VM from one host to another without disrupting services.” • The VM:s state (memory pages) is transferred in the background with the VM still running • The file system is typically located on a NFS and is not moved

  3. Live migration- Typicalalgorithm The time between the VM is suspended and resumed is defined as the downtime Our goal is to reduce the downtime

  4. Live migration- Problems with the typical algorithm When migrating memory intensive VM or over slow NW links: • Memory pages can be dirtied faster than they are transferred over the network • The VM has to be suspended for an extended period of time -> long downtime • Network connections time out and drop / triggers fail Leads to disruption of services

  5. Live migration- Problems with the typical algorithm (cont) Problem dirtying rate >migration throughput Possible Solutions Decrease dirtying rate or increase migration throughput • Decreasing dirtying rate might hurt server performanceand disrupts services Increase migration throughput!

  6. Delta compression- Increasing migration throughput Overall idea: transfer changes to pages instead of the full page contents thus increasing migration throughput • Store sent pages in a cache • When transferring, if the page is cached, compute an XOR delta page • Compress the delta page

  7. Delta compression- continued Vanilla (no compr.) • Wasting time on cache misses • Efficient caching scheme and compression algorithm is vital! Delta compression

  8. Delta compression- caching Desired properties: • Lean • Constant seek time regardless of size L2 caching scheme

  9. Delta compression- compression Desired properties: • Lean (low cpu usage) • Effective (high compression ratio) • General purpose The XOR delta page is suitable for RLE compression • (Symbol)(Repetitions) → AAAAABBBCCCCC = 5A3B5C XOR BinaryRunLengthEncoding -> XBRLE

  10. XBRLE compression- Sourcesidealgorithm

  11. XBRLE compression- Destination sidealgorithm

  12. XBRLE compression- conceptual illustration

  13. Implementation Modified version of qemu-kvmuserspace code to support the XBRLE migration algorithm. Lean, ~500 LoC Evaluation done on version 0.11.2

  14. DemoMigrating streaming video over 10Mbit/s Before migration:

  15. DemoMigrating streaming video (cont) After migration:

  16. DemoMigrating streaming video (cont)

  17. Evaluation- Test cases • Memory write benchmark (lm_bench) • 1 GB RAM, 1 vcpuVM • Near ideal case • Transcoded HD Video • 1 GB RAM, 1 vcpuVM • Real-world, non-ideal case • SAP ERP application • 8 GB RAM, 4 vcpus VM • Large business application • Relies on transactions and is thus sensitive to extended downtime

  18. Evaluation- Experimental setup Benchmark and HD Video 2x 2,66GHz core2quad 16GB RAM NFS share on source machine 100Mbit/s Network SAP ERP 2x 3,0GHz Xeon dual-core 32GB RAM 16TB Raid 5, 6Gbits/s trunked NFS server 1000Mbit/s Network

  19. Evaluation- Benchmark • Downtime reduced by a factor of 100 • Throughput increased by 63 %

  20. Evaluation- Streaming video • UDP downtime reduced from 8 s to 1 • Migration is transparent using XBRLE

  21. Evaluation- SAP ERP Vanilla XBRLE • The ERP application was non-responsive on resume using the vanilla algorithm but survived using XBRLE • “Rule of thumb” is that more than 0.5 s of downtime might hurt the system. Measured downtime was 0.2 for XBRLE and 2 for vanilla.

  22. Conclusion • Delta compression works well migrating • VMs running workloads with a highly compressible working set • VMs running heavy workloads with large working sets • and/or over slow networks (i.e., WANs).

  23. Futurework • Page priority algorithm • Avoid re-sends of pages that are dirtied frequently • Promising early results

More Related