1 / 31

VCS Building Blocks

VCS Building Blocks. Topic 1: Cluster Terminology. After completing this topic, you will be able to define clustering terminology. A Nonclustered Computing Environment. Definition of a Cluster.

freya-house
Download Presentation

VCS Building Blocks

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. VCS Building Blocks

  2. Topic 1: Cluster Terminology After completing this topic, you will be able to define clustering terminology.

  3. A Nonclustered Computing Environment

  4. Definition of a Cluster A cluster is a collection of multiple independent systems working together under a management framework for increased service availability. Application Node Storage Cluster Interconnect

  5. Definition of VERITAS Cluster Server and Failover VCS detects faults and performs automated failover. Application Node Failed Node Storage Cluster Interconnect

  6. Definition of an Application Service • An application service is a collection • of all the hardware and software • components required to provide a • service. • If the service must be migrated to another system, all components need to be moved in an orderly fashion. • Examples include Web servers, databases, and applications.

  7. Definition of a Service Group • A service group is a virtual container • that enables VCS to manage an • application service as a unit. • All components required to provide the service, and the relationships between these components, are defined within the service group. • A service group has attributes that define its behavior, such as where it can start and run.

  8. Service Group Types • Failover: • The service group can be online on only one cluster system at a time. • VCS migrates the service group at the administrator’s request and in response to faults. • Parallel • The service group can be online on multiple cluster systems simultaneously. • An example is Oracle Real Application Cluster (RAC). • Hybrid This is a special-purpose type of service group used to manage service groups in replicated data clusters (RDCs). RDCs use replication between systems at different sites instead of shared storage.

  9. Definition of a Resource Resources are VCS objects that correspond to the hardware or software components of an application service. • Each resource must have a unique name throughout the cluster. Choosing names that reflect the service group name makes it easy to identify all resources in that group, for example, WebIP in the WebSG group. • Resources are always contained within service groups. • Resource categories include: • Persistent • None (NIC) • On-only (NFS) • Nonpersistent On-off (Mount)

  10. Resource Dependencies Resources in a service group have a defined dependency relationship, which determines the online and offline order of the resource. • A parent resource dependson a child resource. • There is no limit to the number of parent and child resources. • Persistent resources, such as NIC, cannot be parent resources. • Dependencies cannot be cyclical. Parent Parent/child Child

  11. Resource Attributes • Resource attributes define • an individual resource. • The attribute values are used by VCS to manage the resource. • Resources can have required and optional attributes, as specified by the resource type definition. WebMount resource Solaris mount –F vxfs /dev/vx/dsk/WebDG/WebVol /Web

  12. Resource Types • Resources are classified • by type. • The resource type specifies the attributes needed to define a resource of that type. • For example, a Mount resource has different properties than an IP resource. Solaris mount [-F FSType] [options] block_devicemount_point

  13. 10.1.2.3 WebVol logVol WebDG eri0 Disk Group Volume Mount IP monitor NIC clean online offline Agents: How VCS Controls Resources Each resource type has a corresponding agent process that manages all resources of that type. • Agents have one or more entry points that perform a set of actions on resources. • Each system runs one agent for each active resource type. /web /log

  14. Topic 2: Cluster Communication After completing this topic, you will be able to describe cluster communication mechanisms.

  15. The cluster interconnect serves to: Determine which systems are members of the cluster using a heartbeat mechanism. Maintain a single view of the status of the cluster configuration on all systems in the cluster membership. Cluster Communication A cluster interconnect provides a communication channel between cluster nodes.

  16. LLT Low-Latency Transport (LLT) LLT: • Is responsible for sending heartbeat messages • Transports cluster communication traffic to every active system • Balances traffic load across multiple network links • Maintains the communication link state • Is a nonroutable protocol • Runs on an Ethernet network LLT

  17. GAB: Performs two functions: Manages cluster membership; referred to as GAB membership Sends and receives atomic broadcasts of configuration information Is a proprietary broadcast protocol Uses LLT as its transport mechanism GAB LLT LLT Group Membership Services/Atomic Broadcast (GAB) GAB LLT

  18. Fencing: Monitors GAB to detect cluster membership changes Ensures a single view of cluster membership Prevents multiple nodes from accessing the same Volume Manager 4.x shared storage devices GAB Fence LLT GAB LLT The Fencing Driver Reboot Fence

  19. The VCS engine, the high availability daemon: Runs on each system in the cluster Maintains configuration and state information for all cluster resources Manages all agents The hashadow daemon monitors HAD. Fence GAB LLT The High Availability Daemon (HAD) HAD hashadow

  20. GAB LLT Comparing VCS Communication Protocols and TCP/IP HAD User Processes iPlanet hashadow TCP Kernel Processes IP NIC Hardware NIC

  21. Topic 3: Maintaining the Cluster Configuration After completing this topic, you will be able to describe how the cluster maintains the configuration.

  22. main.cf HAD HAD hashadow Maintaining the Cluster Configuration • HAD maintains a replica of the cluster configuration in memory on each system. • Changes to the configuration are broadcast to HAD on all systems simultaneously by way of GAB using LLT. • The configuration is preserved on disk in the main.cf file. hashadow

  23. main.cf VCS Configuration Files include "types.cf" cluster vcs ( UserNames = { admin = ElmElgLimHmmKumGlj } Administrators = { admin } CounterInterval = 5 ) system S1 ( ) system S2 ( ) group WebSG ( SystemList = { S1 = 0, S2 = 1 } ) Mount WebMount ( MountPoint = "/web" BlockDevice = "/dev/vx/dsk/WebDG/WebVol" FSType = vxfs FsckOpt = "-y" ) • A simple text file is used to store the cluster configuration on disk. • The file contents are described in detail later in the course.

  24. Topic 4: VCS Architecture After completing this topic, you will be able to describe the VCS architecture.

  25. VCS Architecture • Agents monitor resources on each system and provide status to HAD on the local system. • HAD on each system sends status information to GAB. • GAB broadcasts configuration information to all cluster members. • LLT transports all cluster communications to all cluster nodes. • HAD on each node takes corrective action, such as failover, when necessary.

  26. Topic 5: Supported Failover Configurations After completing this topic, you will be able to describe the failover configurations supported by VCS.

  27. Active/Passive Before Failover After Failover

  28. Active/Passive N-to-1 Before Failover After Failover

  29. Active/Passive N + 1 Standby Server After Failover Before Failover Standby Server After Repair

  30. Active/Active Before Failover After Failover

  31. N-to-N Before Failover After Failover

More Related