620 likes | 759 Views
Business Continuity. For busy IT people GOETEC seminar 16 th February 2012. A bit about me. David Hayling. Kent MAN operations manager for 10 years. Kent MAN operations manager for 10 years Microwave radio links (rain, trees) ATM (LANE, clock)
E N D
Business Continuity For busy IT people GOETEC seminar 16th February 2012
A bit about me David Hayling
Kent MAN operations manager for 10 years Microwave radio links (rain, trees) ATM (LANE, clock) BT circuits, first wavestream (spares) The BT ‘excuse book’ (back breaking)
Christ Church Infrastructure Manager • One or two interesting experiences
Christ Church Infrastructure Manager • One or two interesting experiencesflood, fire, pestilence … …
“In theory, theory and practice are the same. In practice, they are not.” Albert Einstein
City University fire 2001 [guardian, Tuesday 22 May 2001] “Around 300 people had to be evacuated from City University's college building in central London last night, after a fire gutted the roof and fourth floor offices.”
City University fire 2001 [guardian, Tuesday 22 May 2001] “Around 300 people had to be evacuated from City University's college building in central London last night, after a fire gutted the roof and fourth floor offices. Students continued to sit their examinations today.”
Causes of outage BCS – BC in practice
Causes of outage BCS – BC in practice
Five golden rules of business continuity British Computer Society
Understand the business requirements • Institutional DR / BC plan • Make friends with the auditor • Insurance officer • Check with fellow service providers • Estates • Senior managers • Your manager
Five golden rules • Understand the business requirements • Commit time and effort from across the business • Internal communications is critical • Documentation should match the organisation • Test the plan
Hardware fault BCS – BC in practice
Hardware fault BCS – BC in practice
Hardware fault • Look at your key business systems • Network • AAA • Key services – web, mail, teaching
Hardware fault • Identify single points of failure • Risk asses • Mitigate / accept • RAID 1,5, 10, … • SAN • Virtualisation
Hardware fault www.brentozar.com Hierarchy of Database Needs
Hardware fault • Test your backups • Can you recover the data • How long does it take • Maintenance contracts • What do they cover – just break/fix– replacement • Cold spares • Check you can deploy
Human error BCS – BC in practice
Human error BCS – BC in practice
Human error BCS – BC in practice
Human error • Change control • Don’t change unless you know (and have written down); why, what, when, to what, who to tell, what success looks like, backout plan, test plan • Working mobile phones • Normally used
Software malfunction BCS – BC in practice
Software malfunction BCS – BC in practice
Software malfunction BCS – BC in practice