460 likes | 892 Views
Automating Disaster Recovery with Site Recovery Manager. Alec Felgemaker Sr Systems Engineer, VMware. Agenda. Overview Site Recovery Manager Product Description Benefits SRM Walkthrough Conclusion: VMware’s Business Continuity Vision. Virtual Datacenter OS from VMware. SaaS. Web 2.0.
E N D
Automating Disaster Recoverywith Site Recovery Manager Alec Felgemaker Sr Systems Engineer, VMware
Agenda • Overview • Site Recovery Manager • Product Description • Benefits • SRM Walkthrough • Conclusion: VMware’s Business Continuity Vision
Virtual Datacenter OS from VMware SaaS Web 2.0 .Net Windows Linux J2EE Grid Application vServices Availability Security Scalability ……. VMware Infrastructure -> Virtual Datacenter OS Infrastructure vServices Cloud vServices vCompute vStorage vNetwork Application Management vCenter Infrastructure Management • Site Recovery Manager • Lifecycle Manager • ConfigControl • Orchestrator • Capacity IQ • Chargeback
All available across physical hardware, operating systems, and applications VMware Infrastructure: The Safest Place To Run Applications
Agenda • Overview • Site Recovery Manager • Product Description • Benefits • SRM Walkthrough • Conclusion: VMware’s Business Continuity Vision
Complex Recovery Processes and Infrastructure Dependent on Perfect Training, Documentation, and Execution Failure to Meet Recovery Requirements • Recovery takesdays to weeks • Recovery tests often fail • Significant IT time and resources consumed Challenges of TraditionalDisaster Recovery
HardwareIndependence Partitioning and Consolidation HardwareIndependence Resource Pooling Encapsulation Key Features of Virtualization for DR
“Without VMware Infrastructure, it would have taken us weeks to recover our critical systems when Hurricane Katrina hit our datacenter. VMware Infrastructure enabled us to get our critical systems up and running within 24 hours.” Best 55% Disaster RecoveryProduct Scott Fontenette Hancock Bank 2006 Of customers SearchWinComputing.com Using virtualization forbusiness continuity VMware Infrastructure forDisaster Recovery
Simplifies and automates disaster recovery workflows: Setup, testing, failover Turns manual recovery run books into automated recovery plans Provides central management of recovery plans from VirtualCenter Works with VMware Infrastructure to make disaster recovery rapid, reliable, manageable, affordable App App App App App App OS OS OS OS OS OS VMwareInfrastructure VMwareInfrastructure VMware Site Recovery Manager Site Recovery Manager leverages VMware Infrastructure to deliver advanced disaster recovery management and automation Production Recovery
Site Recovery Manager Use Cases • Target scenarios • Restart of tens or hundreds of VMs in another datacenter • Restart can be unplanned (disaster) or planned (migration) • Can tolerate RTO of minutes to hours • Requirements • Second site running VirtualCenter and ESX • Replicated Fibre Channel or iSCSI LUNs from supported storage vendors • SRM is not • A replication product • Geo-clustering for applications in VMs
Site Recovery Manager • Manages and monitors recovery plans • Tightly integrated with VirtualCenter VMware Infrastructure • Requires ESX server 3.0.2 or 3.5 • Requires VirtualCenter 2.5 or later App App App App OS OS OS OS VMwareInfrastructure Storage • iSCSI or FibreChannel storage Storage Partner Replication • Integrated via replication adapters created, certified and supported by replication vendor Partner Replication Site Recovery Manager Key Components VirtualCenter SRM Servers Storage
Integrate with replication Identify which virtual machines are protected by replication configuration Map recovery resources Network resources, server resources, management objects Create recovery plans For virtual machines, applications, business units Convert manual runbook to pre-programmed response Customizable with scripting and callouts Disaster Recovery Setup
Integrate with replication Identify which virtual machines are protected by replication configuration Map recovery resources Network resources, server resources, management objects Create recovery plans For virtual machines, applications, business units Convert manual runbook to pre-programmed response Customizable with scripting and callouts Disaster Recovery Setup Storage Partners
Detect site failures Raise alert when heartbeat lost Initiate failover User confirmation of outage Granular failover initiation Manage replication failover Break replication Make replica visible torecovery hosts Execute recovery process Use pre-programmed plan Provide visibility into progress Manage networking Put VMs on right VLAN Change IP addresses Failover Automation
Detect site failures Raise alert when heartbeat lost Initiate failover User confirmation of outage Granular failover initiation Manage replication failover Break replication Make replica visible torecovery hosts Execute recovery process Use pre-programmed plan Provide visibility into progress Manage networking Put VMs on right VLAN Change IP addresses Failover Automation
Replication Management Snapshot replicated LUNsbefore test Delete snapshots of replicated LUNs after test Network Management Change all virtual machinesto a test port group before powering them on Customization/extensibility Same breakpoints and calloutsas failover sequence Extra breakpoints and callouts around the test bubble ! ü ? Testing
Replication Management Snapshot replicated LUNsbefore test Delete snapshots of replicated LUNs after test Network Management Change all virtual machinesto a test port group before powering them on Customization/extensibility Same breakpoints and calloutsas failover sequence Extra breakpoints and callouts around the test bubble Testing
Setup DR protection from DR site back to primary site Failover makes VMs reside at the DR site Provide the failed-over VMs with protection Same setup as was done for initial protection Work with storage to reverse replication Test failback Test repeatedly – same mechanism as with test failover Only set the failback date after the plan is perfect Failback to primary site Just hit the failover button—failback is failover in the reverse direction Failback
1 Accelerate Recovery 2 Ensure Reliable Recovery 3 Simplify Planning and Recovery 4 Expand Disaster Recovery Protection 5 Reduce Cost 6 Enable Compliance SRM Benefit Summary
Agenda • Overview • Site Recovery Manager • Product Description • Benefits • SRM Walkthrough • Conclusion: VMware’s Business Continuity Vision
VMware Site Recovery Manager walkthrough Site A Site B Client to access Virtual Client to access Virtual Infrastructure Services including Infrastructure Services including SRM protected services SRM protected services Protected VMs Protected VMs Protected VMs Protected VMs AD Server AD Server SQL Server SQL Server VC + SRM VC + SRM VDM Server VDM Server Infrastructure Infrastructure ESX ESXServer ESXServer ESXServer Services Server Services always on for Site A SRM protected VMs are SRM protected VMs will offline in site B ready for a be recovered to Site B SRM failover from Site A Datastore replication from Site A to Site B which is required for SRM protection Non-Replicated Non-Replicated Non-Replicated Non-Replicated Datastores Datastores Datastores Datastores
SRM Failover Overview • Shutdown protected VMs in Site A • If online, orchestrates the controlled shutdown of protected VMs • If offline, no action taken against protected VMs in Site A • Promote the storage in Site B • Replicated datastores are promoted to be Read/Write enabled • Suspend non-critical VMs in Site B • VMs identified to be non-critical are shutdown during failover • Protected VMs from Site A powered up in Site B • High priority VMs start up first • Followed by Normal and Low Priority VMs
SRM Setup Overview • Protected Site • Pairing of Site A with Site B • Array Manager Configuration • Inventory Preferences • Protection Group Setup • Recovery Site • Recovery Plan Setup • Test your Recovery Plan • SRM allows you to test your recovery plans without impacting production services Practice makes perfect so test and test again
Inventory Preferences Site A Site B Network Resource Pools VM Folders
Recovery Plan for Complete Site Failover Protected VM shutdown Prepare Storage External scripts Suspend non critical VMs Protected VM Recovery - High / Normal / Low Recover “No Power On” VMs
Agenda • Overview • Site Recovery Manager • Product Description • Benefits • SRM Walkthrough • Conclusion: VMware’s Business Continuity Vision
App App App App App OS OS OS OS OS APP APP APP OS OS OS VM VM VM Load Balance Firewall Tomcat IIS Oracle Integrated, Automated, and Policy-Based Instructions • Name=eCommerce • Only port 80 is used • 100 ms web response • VRM: Encrypt w/ SHA-1 • DR RPO: 1 hour • Decommission in 1 month HA/FT Data Protection Application and Infrastructure VMs Disaster Recovery VirtualCenter Cluster Management Physical Hardware
Bringing Automation to the Datacenter Separate Consolidate Aggregate Automate Liberate Test and Development Server Consolidation CapacityOn Demand Self-Managing Datacenter Computing Clouds On and Off Premise
Automate Self-Managing Datacenter Bringing Automation to the Datacenter • Facilities cooperation between VI farms at multiple sites managed by different people • Fundamentally SRM is a VM migration tool • Planed migrations for datacenter moves or consolidation • Unplanned migrations in case of disaster • Strategic relationships with service providers show the beginnings of the “DR to the cloud” movement
! ü ? Summary VMware Infrastructure is the safest place to run applications • Site Recovery Manager brings DR to your server and desktop workloads—today! • VMware’s partnerships with storage ecosystem help you leverage your existing storage investments and get even more value from them • VMware brings simplified and reliable DR to the most demanding enterprise applications – as shown here with SAP • VMware’s vision will continue to drive up availability anddrive down cost
For More Information… Resources on vmware.com: Disaster Recovery VMbook Site Recovery Manager Evaluator’s Guide Site Recovery Manager Compatibility Matrix