350 likes | 548 Views
Protecting VMware Data Off-site. “Tape vs. Cloud Options” Bill Evans, Arkeia Software “Case Study from University of Chicago” Tom Indelli, Senior System Administrator. Data Loss and Data Protection. Causes of Data Loss Strategies for Data Protection Replication Backup.
E N D
Protecting VMware Data Off-site “Tape vs. Cloud Options” Bill Evans, Arkeia Software “Case Study from University of Chicago” Tom Indelli, Senior System Administrator
Data Loss and Data Protection • Causes of Data Loss • Strategies for Data Protection • Replication • Backup
Causes of Data Loss Source: Kroll Ontrack Inc., 2011
Data Protection Strategy #1: Replication • Replication • Additional copy of current data (files, images, objects) • Replication Options • Disk (or RAID) • Synchronous • Expensive: data replicated before transaction completes • Asynchronous • Less expensive: data replication lags behind • Replication Benefits • Offsite data storage • Immediate failover Protected Unprotected Replication Faithfully Copies All Errors; Over 50% Of Data Loss is Unprotected
Data Protection Strategy #2: Backup • Backup • Multiple Point-in-time “Restore Points” • Backup Options • Tape or Disk or Cloud • Hourly, Daily, Weekly, Monthly, Quarterly, Yearly • Backup Benefits • Recovery to time in the past • Offsite data storage
Backup Requirements • Secure • Off-site • Off-line • Frequent Restore Points • Restore Point Objectives (RPO) to minimize data loss-1 -2 -7 -14 -30 -60 -180 -365 days • Rapid Restore Time • Restore Time Objectives (RTO) to minimize down-timehours +4 +3 +2 +1 +0.5 +0.1
Off-site Storage • How to choose? • Costs • Fixed • Variable • Backup window • Time-to-restore (RTO) • Reliability • Convenience Backup Agent Backup Agent Backup Server Backup Server Backup Agent Backup Agent Backup Agent Backup Agent WAN Backup Agent Backup Server Backup Agent Backup Agent WAN Backup Server
Off-site Storage Copy is moved offsite Backup Agent Backup Agent Backup Server Backup Server Backup Agent Backup Agent Backup Agent Backup Agent Backup Agent Backup Server Backup Agent Backup Agent Backup Server Copy is moved offsite
Off-site Storage Strategies • Why is Off-site Storage Important? • Loss, theft, site destruction • Strategies • Tapes on trucks • Replication to the cloud • Costs Cloud Tape Data Volume Protected
University of Chicago: VMware Backup Strategy Tom Indelli Senior Systems Administrator University of Chicago
Organization • University of Chicago • Physical Sciences Division • Activities • Theoretical Chemistry (e.g. Molecular Dynamics) • Theoretical Physics • Science Education
Deployment #1: Data & Servers Analyses are computationally-intensive;Physical platforms deliver best performance • Data • Theoretical Chemistry & Molecular Dynamics • Simulations of atoms using “trajectory files” • 20,000 atoms to 100,000 atoms • Jobs run up to 48 hours • Simulate less than 50 nanoseconds of interactions • Most operation is “batch”, performed on 100-node compute clusters • Protected Servers • 2 Red Hat and 1 MacOS file servers • File servers hold inputs to and results of simulations • 44TB source data
Deployment #1: Data Protection Compression occurs in Arkeia agent, before backups are moved on the LAN Red Hat EL 6.0 Red Hat EL 6.0 MacOS X Arkeia Backup Server v9 on RHEL 2Gbps LAN • Backup Server Solution • Arkeia Network Backup v9 on Red Hat 6.0 • 100TB disk (backup target DAS) • Backup Strategy • Backup to Disk • Weekly full, nightly incremental • Agents backed up concurrently • Offsite Strategy • None
Deployment #2: Data & Servers • Data • Web servers • Management software & data • Support software & data • Uninterrupted operation is critical • Protected Servers • 2 ESXi 4.1 hosts with vCenter 4.1 (facilitates upgrades) • 15 - 20 virtual machines • 3TB source data
Deployment #2: Data Protection Compression occurs in Arkeia agent, before backups are moved over LAN VM A.1 VM A.2 VM A.3 VM B.1 VM B.2 ANB VM Hypervisor #A Hypervisor #B Arkeia Backup Server v9 on RHEL 2Gbps LAN • Backup Server Solution • Backup Strategy • Backup to Disk (20TB EqualLogic SAN) • Weekly full, nightly incremental • Three groups of backups performed in sequence • Replicate to Tape Library (Dell Powervault PL-2000 with LTO4 drive) • Offsite Strategy • Tapes moved to another office
Deployment #2: Backups = 19 LTO4 Cartridges
Deployment #2: vStorage Usage • Backups via vCenter • Backups use Changed Block Tracking (CBT) • Full backups (“Thin full” with CBT) • Incremental backups • Restores • Perform occasional full-image restores • Have tested single-file restores
Costs of Tape vs. Cloud for 18TB Does Not Include Costs of Bandwidth • Tape • 22 LTO4 tapes (18TB) @$30/cartridges = $660 • 1 TL-2000 = $10,000 (amortized over 3 years) • One year costs = $4,000 + tape shuffling • Public Cloud • 18TB @$0.125/GB/month (Amazon) = $2,300/month • One year costs = $28,000
Summary • UChicago has both virtual and physical environments • Physical systems are a better fit for some workloads • Want one backup solution to protect both environments • Off-site storage is required • Off-line is a bonus • vSphere Changed Block Tracking • Accelerates incremental backups • Reduces storage
Thank You Tom Indelli Senior Systems Administrator tindelli@uchicago.edu
Hybrid Cloud Backup • Why Hybrid? • Data Volume Limits • Cloud Infrastructure Requirements
“Hybrid” Cloud Backup • Perform backup on LAN • Fast backups, fast restores • Replicate backups to cloud for safe-keeping • Secure data Backup Agent Backup Server Backup Agent Backup Agent Step 1 Step 2
“Hybrid” Cloud Backup • Full Backup • If time < one week: Over the WAN • If time > one week: Via portable media • Daily Incremental Backup • If time < 24 hours: Over the WAN • If time > 24 hours: Impossible Backup Agent Backup Server Backup Agent Backup Agent Incremental Backup Size Limits Cloud Backup: Incremental Size Is 0.01% to 20% of Full Backup
Cloud Strategies: Replication Window Backup Agent Incremental (1%) Backup Backup Server Backup Agent Full Backup Backup Agent
Role of Deduplication in Backup • Shrinks Data • Reduces Storage • Shortens Backup Window • Data Scenarios • Primary Data • Secondary Data Across/Within Files (e.g. PPT files) Over Time(e.g. outlook.pst) Across computers (e.g. word.exe)
Hybrid Cloud Recovery • Storage-only v.s. Storage-and-Server • File Recovery vs. Disaster Recovery
Cloud Recovery Strategies • Data are Secure • Deduplicated • Compressed • Encrypted • How to Recover/Extract? Backup Agent Backup Server Backup Agent Backup Agent
Cloud Recovery Strategies • How to Recover/Extract? • Restore (via big pipe) to servers in cloud • Restore (via portable media) to new location Backup Agent Backup Server Backup Agent Backup Agent Backup Server
Hybrid Cloud Backup Summary • Alternative to Tape • …But Maximum Data-Protection Limit • Imposed by incremental backup size • Primary Cost of Hybrid Cloud • Bandwidth • (Then target disk) • Pay Attention to Recovery Strategy • Instantiate in Cloud • Recovery on Portable Media
Arkeia Software • Company • Founded 1996; HQ in San Diego • Products • Arkeia Network Backup Suite • Backup/Recovery • Disaster Recovery • Virtual and Physical Environments • vSphere (with CBT), Hyper-V, XenServer • Linux, Windows…AIX, BSD, HP-UX, MacOS, Netware, Solaris (200+ platforms) • Software, Appliances, Virtual Appliances • Disk, Tape, Cloud • Customers • 7,000 mid-market customers in 70 countries • Enterprises, Governments, Service Providers
Please Contact Me Bill EvansArkeia Softwarebill.evans@arkeia.com Resources for last-mile internet for data centers and enterprises • ManonBuettner, Principal • Nuvalo • manon@nuvalo.com • +1 408-605-6455 • Jo Peterson, Regional Manager • Teleproviders • jo@teleproviders.com • +1 949-268-2633
Detail 1 of 3: “Incrementals Forever” Traditional Backup Policy • How Does it Work? • Initially, one full backup • Subsequently, “incrementals forever” t • How to recover disk space at target? • “Synthetic backups” Day 0 1 2 3 4 5 6 7 8 9 …
Detail 2 of 3: Multiple Sources • Deduplication consolidation • Static storage cannot resolve duplicates Backup Agent Backup Agent Backup Server Backup Server • Deduplication vs. Encryption • Dedupe → Compress → Encrypt Backup Agent Backup Agent Backup Agent Backup Agent X X
Detail 3 of 3: WAN Bandwidth • Data Compression • File-grain compression • Examples: LZ-77, JPEG, MPEG • Inter-file deduplication • Examples: SIS, fixed-block, variable-block, progressive-dedupe • TCP Optimization? Warnings: Latency Optimization Bandwidth Optimization No compression of compressed or random data