1 / 34

an e-business perspective Digitask Consultants, Inc. digitask@digitask (212) 682-6652

High Availability High Performance Systems. an e-business perspective Digitask Consultants, Inc. digitask@digitask.com (212) 682-6652. What is High Availability?. Uptime Levels (%). Annual Downtime. Availability Classification. Fault Tolerant 99.9999 < 1 minute

Download Presentation

an e-business perspective Digitask Consultants, Inc. digitask@digitask (212) 682-6652

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. High AvailabilityHigh Performance Systems an e-business perspective Digitask Consultants, Inc. digitask@digitask.com (212) 682-6652

  2. What is High Availability? Uptime Levels (%) Annual Downtime Availability Classification Fault Tolerant 99.9999 < 1 minute Extremely High Availability 99.999 5 minutes Fault Resilient High Availability 99.99 53 minutes High Availability 99.9 8.8 hours Commercial Availability 99.5 43.8 hours Sources: Gartner Group, Transaction Processing Performance Council, Compaq

  3. Opportunity! Why Should I Care?

  4. Why Should I Care?

  5. Average Cost per Hour of Downtime Industry Application Cost of Downtime Financial Brokerage operations ??????? $ 6,500,000 Financial Credit Card Sales ??????? $ 2,600,000 Media Pay Per View ??????? $ 150,000 Retail Home Shopping (TV) ??????? $ 113,000 Retail Catalog Sales ??????? $ 90,000 Transportation Airline Reservations ??????? $ 89,500 Source: Gartner Group and Contingency Planning Research

  6. Cost of Downtime - Equity Value

  7. Cost of Downtime - Equity Value

  8. Top 6 Reasons for Server Failure • Software defects/failures • Planned administrative downtime OS upgrades, DB administration, etc. • Operator error • Hardware outage/maintenance • Building/site disaster fires, sprinkler systems • Metropolitan disaster storm, floods Survey of IS managers Source: Gartner Group

  9. The Solution... Compaq Clusters OpenVMS Tru64 UNIX TruClusters Windows NT

  10. Performance & Availability Production Server Available Server Availability TRU64 UNIX AlphaServer Systems Foundation TruClusters - Yesterday V 1.x Each builds upon the other

  11. HSZx0 HSZx0 Private Disks System Disk Private Disks System Disk TruCluster 1.x Memory Channel Interconnect

  12. TruCluster 5 Memory Channel Interconnect

  13. TruCluster Server Version 5.0 • Single system image cluster • Shared file system • Dramatically easier management • Simpler application availability and scalability

  14. / /usr /var /... /... /... /... /... /... TruCluster 5.0 Feature Summary • Easier management • Clusterwide file system • Cluster alias • Application availability facility • Cluster wide storage • Support for larger & more flexible configurations • No requirement for symmetric configurations • No need for private storage (all storage can be on shared buses)

  15. UNIX Workstation X11 Web / Java PC Tru64 UNIX System Management (LAN) SingleSystem SNMP SNMP WBEM WBEM Cluster (LAN / WAN) CLI Script Tru64 UNIX Management Tru64 UNIX Management (LAN / WAN)

  16. / /usr /var /... /... /... /... /... /... Cluster Management The best cluster management is the management you NEVER have to do! TruCluster V5.0 Traditional UNIX Clusters

  17. / /usr /var /... /... /... /... /... /... Cluster File System • Single cluster-wide namespace with a single shared root • Same view from all cluster members • Mechanism to address member-specific files • Client/Server model initially • Layers on existing file system • AdvFS, NFS, UFS (r/o), CDFS • Transparent file system failover and recovery • Integrated with cluster alias for NFS server

  18. System and Storage Management • CFS is an enabling technology • Most management operations “just work” • Single copy of most configuration files • Device names are consistent cluster-wide • Storage devices are available everywhere • Fewer things to manage • Operating system and applications installed once per cluster • Automatic disk and file system failover • Single security domain • Base and enhanced security

  19. System and Storage Management • BUT… Still must manage some things separately • Kernel tuning, process tuning • Network adapter, tty configuration • Licensing

  20. Cluster Alias Client Client Client Router • Cluster appears as single system to network • Can support multiple aliases • Single host name to clients • Transparent handling of node and adapter failures • Dynamic load balancing • Network services • Efficient forwarding over cluster interconnect Cluster - canine 1.1.1.0 Retriever AlphaServer labrador 1.1.1.1 AlphaServer golden 1.1.1.2 AlphaServer basset 1.1.1.3 AlphaServer bluetick 1.1.1.4 Hound

  21. Application Support • Applications need only be installed once in the cluster although may be licensed per node • Single instance applications • May only run on one member of a cluster at a time • Multiple copies would conflict with each other • Typical old-style ASE applications

  22. Application Application Application Single Instance Applications Channel Memory Interconnect

  23. Application Support • Multiple instance applications • May run on multiple or all cluster members • Multiple copies don’t conflict • Some ASE applications can now run on multiple members

  24. Application Application Application Application Multi-Instance Applications Channel Memory Interconnect

  25. Cluster Application Availability • Provides application failover or restart within the cluster • Application and resource dependencies • Application profile determines failover policy and dependencies • Mechanism for application-specific monitoring • Monitoring of applications via ‘check’ entry in action script • Command line and GUI-based management • ASE application start/stop scripts easily migrate with minimal changes

  26. Application Support • Cluster aware applications • Use cluster features such as the Distributed Lock Manager • Coordinate storage r/w access from multiple nodes

  27. Multi-Instance Applications Application Application Application Application Channel Memory Interconnect

  28. Load Balancing Dynamic load balancing of client connections Cluster Management Applications are installed once for entire cluster Configuration changes made once for the cluster Users are authorized once for all cluster nodes Installation and Configuration Rolling upgrade of o/s Single system image Cluster-wide file system Cluster alias Single security domain Cluster-wide naming of storage devices Single event manager/error log TruCluster Advantages Over Other UNIX Clusters

  29. Hardware specifics Interconnect speed (6-12x faster) Maximum number of nodes: 8 Largest Node supported: ~125,000 tpm-C Smallest node: AlphaServer 800 (<$7,000) Support for Switched Fibre Channel Support for simultaneous direct access to database tables Available API for parallel resource locking Available up-time guarantee 99.99% TruCluster Advantages Over Other UNIX Clusters

  30. Bottom Line - Management Traditional UNIX Clusters Single Systems $ TruCluster Server V5.0 Number of Nodes Tru64 UNIX TruClusters cost less to manage

  31. Bottom Line - Reliability Uptime Guarantees99.99% Plus joint effort by COMPAQ & the customer Business Critical Custom For Eligible Alpha systems Availability Review On-site Spares Installation Priority Executive Package Intimacy of Partnership Priority PremierPackage Customer need for high availability Tru64 UNIX TruClusters are more reliable

  32. Bottom Line - Size Tru64 UNIX TruClusters are more scaleable

  33. Bottom Line - Industry Opinion TruCluster V5.0 "Nines are necessary, but not sufficient. Simple, straightforward use is also vital... Here Compaq has excelled, going the distance in building multi-system scalability, reliability, and manageability into the heart of UNIX." Jonathan Eunice, Illuminata, Inc., 4/99

  34. Thank You John Zimmerman johnz@digitask.com (212) 682-6652

More Related