1 / 18

OpenSSI A Single System Image Linux Cluster Project Openssi Dr. Bruce J. Walker HP Fellow Keerthi Bhushan K N Senior Sy

OpenSSI A Single System Image Linux Cluster Project Openssi.org Dr. Bruce J. Walker HP Fellow Keerthi Bhushan K N Senior Systems Engineer, HP. Why Clusters, why not SMP ?. Performance Availability Price/performance Scaling. Types of Clusters. Availability. Scalability.

dean
Download Presentation

OpenSSI A Single System Image Linux Cluster Project Openssi Dr. Bruce J. Walker HP Fellow Keerthi Bhushan K N Senior Sy

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. OpenSSI A Single System Image Linux Cluster Project Openssi.org Dr. Bruce J. Walker HP Fellow Keerthi Bhushan K N Senior Systems Engineer, HP

  2. Why Clusters, why not SMP ? • Performance • Availability • Price/performance • Scaling

  3. Types of Clusters

  4. Availability Scalability Manageability Usability Properties of clusters Ideal/Perfect Cluster in all dimensions SMP HA Cluster OpenSSI Linux Cluster Project log scale HUGE ReallyBIG

  5. What is SSI ? • Cluster-wide view of files, ipc objects, processes, users, devices and services - single root • Strong Membership • Single management domain • Single external view of cluster • Consistent kernel on each node

  6. OpenSSI Cluster ProjectAvailability • Automatic failover/restart of services in the event of hardware or software failure • Automatic failover/restart of applications in the event of node failure • Automatic filesystem failover • Automatic “root” node failover being worked on

  7. OpenSSI Cluster Project Price / Performance Scalability • More usable memory, processors, networking, devices • More resources to share • Clones can co-ordinate easily • Load balancing of connections and processes • Can mix hardware • Distributed OS algorithms written to scale to hundreds of nodes (and successfully demonstrated to 133 blades and 27 Itanium SMP nodes)

  8. OpenSSI Linux Cluster -Manageability • Single Installation • Online addition of new nodes • Standard standalone system tools for cluster administration • Cluster-wide resource visibility • New cluster-specific commands • Linux commands extended for cluster

  9. OpenSSI Linux Cluster - Ease of Use • Can run anything anywhere with no setup • Install applications only once • Service failover/restart is trivial • Automatic or manual load balancing • libcluster.a - cluster APIs

  10. Uniprocessor or SMP node Uniprocessor or SMP node Users, applications, and systems management Users, applications, and systems management Standard OS kernel calls Standard OS kernel calls Extensions Extensions Standard Linux 2.4 kernelwith SSI hooks Standard Linux 2.4 kernelwith SSI hooks Modular kernel extensions Modular kernel extensions Devices Devices IP-based interconnect How Does SSI Clustering Work? Other nodes

  11. Component Contributions to OpenSSI Cluster Project Lustre Appl. Avail. CLMS OGFS Mosix Beowulf Vproc DLM LVS OCFS IPC DRBD CFS ICS EVMS OpenSSI Cluster Project Load Leveling HP contributed Open source and integrated To be integrated

  12. OpenSSI components - communication CLMS, ICS • Cluster Management subsystem (CLMS) • "Node-down" detections and processing • Membership management • Internodes Communication Subsystem (ICS) • Kernel-to-kernel transport • Provides service daemons and queues for all subsystems • RPC, request/response and messaging interfaces

  13. OpenSSI components - file systemCFS, OpenGFS, Lustre • CFS layered over physical file system (ext2, ext3, xfs, optionally on volume mgmt.) • Single “/” • Cluster-wide mounts • CFS layered over tmpfs • OpenGFS and Lustre support also available

  14. OpenSSI components - process subsystemVproc • Single pid space but allocate locally • Transparent access to all processes on all nodes • Processes can migrate during execution • /proc/<pid>/goto and migrate system call • rfork, rexec, onnode, onall, fastnode commands • process part of /proc is systemwide (so ps & debuggers “just work” systemwide) • Implemented via a virtual process (Vproc) architecture • Work in progress on providing “view” for a process

  15. OpenSSI components - load levellingOpenMOSIX, LVS • Connection load levelling (LVS) • Process load levelling (OpenMOSIX algorithms) • “migrate” command • /proc/cluster/loadlevellist • Decentralized load-balancing decisions

  16. OpenSSI components - ipc • ipc objects (pipes, fifo's and ptys inclusive) are locally created • Cluster-wide ipc namespace (nameserver) • Cluster-wide ipc object access

  17. Component information • LVS - Linux Virtual Server http://www.LinuxVirtualServer.org • DLM - Distributed Lock Manager http://sourceforge.net/projects/opendlm • Beowulf / MPICH / LAMPI / PBS / MUAI/… http://www.beowulf.org • DRBD - Distributed Replicated Block Device http://drbd.cubit.at • EVMS - Enterprise Volume Management System http://sourceforge.net/projects/evms/

  18. Conclusions • OpenSSI provides a common cluster framework for all forms of clustering • OpenSSI simultaneously addresses availability, scalability, manageability and usability • OpenSSI is usable in production now Come see a demo in the HP booth

More Related