330 likes | 338 Views
Learn about the rising importance of business continuity and the risks of not having a solid disaster recovery plan. Discover key industry terms, IT risks, and the elements of business continuity and disaster recovery. Explore the evolution of HA/DR solutions and the impact of changes in IT infrastructure.
E N D
Rethinking Data Protection/Disaster RecoveryA Business Continuity IT Primer Paul Mallon Technical Account Manager UK Public Sector
Market Indicators Show Attention to Business Continuity is Rising* • 59% of CIOs surveyed have increased spending for and focus on Business Continuity Initiatives • Only 26% of all organisations have calculated the cost of downtime • 66% of enterprises don’t test their disaster recovery plan yearly • 70% of Organizations who cannot access their data within 7-10 days are at risk of going out of business • Current practice focuses only on compliance with Disaster Recovery requirements *Based on Financial Times paper, “Business Continuity and Disaster Recovery”, Symantec customer research, EMS reports
Industry Terms • Disaster – any unplanned event that significantly impacts business functions • Disaster Recovery – Activities and programs designed to return the compute power of an organization to an acceptable condition. 1) The ability to respond to an interruption in services by implementing a disaster recovery plan to restore an organization's critical business applications
Industry Terms • Business Continuity Management – A holistic management process that identifies potential impacts that threaten an organization and provides a framework for building resilience with the capability for an effective response that safeguards the interests of its key stakeholders, reputation, brand and value creating activities
IT Industry Terms • Recovery Point Objective (RPO) • Data Loss Management, or how old can the data be before there is significant impact to the business? • Recovery Time Objective (RTO) • Recovery Time Management, or how long can end users wait to get data and applications back before there is significant impact to the business? • Measured from time of interruption to resumption
List of IT Risks That Create Business Outages is Growing Business Risk Business Risk • Market risk • Credit risk • Interest rate risk • Currency risk Operational Risks Other Risks • Business process • People and talent • Environment • Physical infrastructure Non-IT Risks IT Risks Security Risk Availability Risk Performance Risk Scalability Risk Recoverability Risk Compliance Risk • Computer crimes • Internal breaches • Cyber terrorism • Configuration changes • Lack ofredundancyin architectures • Human errors • Distributed architectures • Peak demand • Heterogeneity inthe IT landscape • Business growth • Provisioning bottlenecks • Silo-edarchitectures • Hardware and/or software failures • External threatssuch as security • Natural disasters • Government regulations • Corporate governance guidelines • Internal policy Business Continuity Elements Disaster Recovery Elements
Sound Familiar? This is how much it will cost… How much time and data can you afford to lose? Guess I don’t need it that fast! Needs vs. Wants
DR Site Local Datacenter HA/DR Problems 2001 - Now Duplication is Very Expensive Hardware Failure Data Loss
DR Site Local Datacenter HA/DR Solutions Circa 2001-2006 Proliferation of Two Node Clusters Data Replication Tape Backup Maturation of Company’s Disaster Recover Strategies
DR Site Local Datacenter HA/DR Solutions Circa 2001-2006 Proliferation of Two Node Clusters Data Replication Tape Backup Low RPO and RTO Tapes were difficult to recover Maturation of Company’s Disaster Recover Strategies
DR Site Local Datacenter HA/DR Solutions Circa 2001-2006 Proliferation of Two Node Clusters Data Replication Tape Backup Poor Server Utilization Complicated Management Low RPO and RTO Tapes were difficult to recover Maturation of Company’s Disaster Recover Strategies
DR Site Local Datacenter HA/DR Solutions Circa 2001-2006 Proliferation of Two Node Clusters Data Replication Tape Backup Poor Server Utilization Complicated Management Low RPO and RTO Tapes were difficult to recover Data at DR site, but application still down Maturation of Company’s Disaster Recover Strategies
Changes in IT and Impact on HA/DR Everything is Mission Critical Scale of IT (Larger Data Centers) Human Impact
Changes in IT and Impact on HA/DR Everything is Mission Critical Scale of IT (Larger Data Centers) Human Impact
Databases Middleware Applications Network Storage Servers Virtual Machines Data Center Complexity 90 91 92 93 97 95 96 89 94 87 83 85 84 98 82 81 54 79 88 99 100+ 86 78 77 76 80 67 68 69 70 73 66 74 71 65 61 72 62 75 60 59 58 57 56 55 63 64 TOOLS REQUIRED 44 43 42 41 40 36 38 37 45 35 25 31 48 46 27 28 29 30 39 52 51 50 49 53 34 47 26 32 24 33 22 21 20 19 18 16 15 14 13 12 11 17 9 8 7 6 5 4 10 3 2 1 0 23 Data Protection Storage Management Server Management Application Performance NetWorker Galaxy ArcServe Media Mirror DiskXtender EmailXtender TSM SAM-FS Data Migrator RSS NearStore BrightStor Mobile Backup ECC AppIQ Creekpath HiCommand TPM SAN Copy MirrorView RepliStor TrueCopy DoubleTake PPRC SRDF MPIO Sun SRM ReiserFS SAN Navigator Aperi ShadowImage InstantImage SnapView Shadow Copy FlashCopy TimeFinder Ext3 SANFS PowerPath ServiceGuard Sun Cluster MSCS HA-CMP TrueCluster IBM TPM / TIO BMC HP OpenView CA Jumpstart Opsware Bladelogic Tivoli Data Protector EDM NT Backup OnTap NetVault LiveVault SyncSort Retrospect Ultrabac Tapeware DLM DLM LVM SVM ASM MDUX SVC LDM OCFS DFM UFS ZFS JFS GPFS Altiris ClusterFrame Polyserve GeoSpan Qlusters SteelEye Kickstart N1 Grid HP UDC ADS, SMS Marimba AppManager OEM Patrol Foglight DBArtisan DGI Topaz CCMS PAC Optane Silk TheGuard eHealth Vantage PathFinder Introscope JProbe Sitraka MOM Performasure Tivoli Patrol Corefirst Appsight
Manage Data Center Complexity Built in management tools: Location, Platform, Physical or Virtual Same tools across every platform Centralized Management Powerful Reporting
HA/DR for Virtualization Increased need for Production Level HA/DR for business critical applications Taking Virtualization to another level: Mixed Physical and Virtual Environments Management of Physical and Virtual Data Centers
Virtualization Means Even More Platforms • vPars • nPars • Virtual Machines • Secure Resource Partitions • Zones • Containers • LPARs • Micro-Partitions Application Availability, Disaster Recovery, Server Provisioning, Config Management Storage Management
Symantec Solutions for Physical & Virtual Servers • vPars • nPars • Virtual Machines • Secure Resource Partitions • Zones • Containers • LPARs • Micro-Partitions Veritas HA/DR Solutions Application Availability, Disaster Recovery, Server Provisioning Storage Management
Changes in IT and Impact on HA/DR Everything is Mission Critical Scale of IT (Larger Data Centers) Human Impact
Interdependent web of applications, databases, services Changes to one application impacts another Configuration drift in clustered environments preventing proper failover The Human Impact “80% of mission critical application downtime is caused by people and process issues (versus technology issues).” Leading IT Analyst
Proactive HA/DR through Configuration Management • Identifying problems • Proactively determine the impact of a change • Tracking changes over time DISCOVERY DEPENDENCY MAPPING CHANGE CONTROL
Prod SG FD SG Non Disruptive Testing Secondary Site Primary Site Never Tested DR Plan 78% Testing Impacts Production Testing is Inconvenient Snapshot
Changes in IT and Impact on HA/DR Everything is Mission Critical Scale of IT (Larger Data Centers) Human Impact
Everything is Business Critical Government Regulations End User Expectations Importance of IT in the Business Expectations are no downtime or data loss, however be mindful of costs!
Making HA/DR Work and Affordable Backup for All Employees Larger Clusters Increase Server Utilization Remote Office Data Center Server and Storage Tiering Zero Data Loss That’s Affordable Tier One Tier Two Tier One
Dual Use DR Sites Production Site Test/Dev Site LOCAL CLUSTER WIDE-AREA CLUSTER -Site Outage- Test/Dev Servers Reprovisioned • Architecture graphic? • Intuit example? MIRRORING REPLICATION
DR Requirements SECONDS MINUTES HOURS DAYS RECOVERY TIME RECOVERY POINT Manually rebuild servers Automated server provisioning (Provisioning Manager) Wide-area clustering (Cluster Server) application Tape backup (NetBackup) Asynchronous replication (Volume Replicator) Synchronous replication (Storage Foundation or Volume Replicator) data
Business Continuity Solutions from Symantec Plan Design and implement Business Continuity and Disaster Recovery Requirements and Strategies Protect Protect against security threats and avoid outages by proactively monitoring configurations and doing regular testing Business Continuity Respond Ensure that service delivery objectives are met through data protection, replication, clustering, and data recovery
Thank you • Paul Mallon • Technical Account Manager • UK Public sector • Paul_Mallon@Symantec.com