190 likes | 318 Views
Gangalia. Distributed monitoring system. Why Monitor?. Requirements Manage many computers Spot trends in the system Increase Performance Identify problems Applications. Monitoring Grids. Grid consists of Nodes ( A single machine) Clusters (Collection of Nodes)
E N D
Gangalia Distributed monitoring system
Why Monitor? • Requirements • Manage many computers • Spot trends in the system • Increase Performance • Identify problems • Applications
Monitoring Grids • Grid consists of • Nodes ( A single machine) • Clusters (Collection of Nodes) • Grids (Collection of Clusters) • General Objective of Grid: To perform high performance computing. • Solution: Monitor at levels.
Monitoring Nodes • Nodes • A terminal with single/multiple processors. • Factors to monitor • Temperatures • CPU/Memory Usage • Disk space • Network Activity • Jobs • Provide vital statistics of each node.
Monitoring Clusters & Grids • Clusters & Grids • Collection of Nodes • Factors to monitor • Load • Processing power • Uptime • Availability • Provides performance statistics
Ganglia • Gangalia: is distributed monitoring system. • Based on a hierarchical structure • Lightweight :- low overhead and high concurrency. • Prominent Features:- • Visualization using graphs • Selective statistics
Gangalia: Architecture [IBM (2008), 'Perormance Monitoring using Ganglia', IBM Manual - Wiki.]
Gangalia: Gmond • Lightweight service • Records and sends data via XDR • CPU statistics • Memory statistics • Network statistics • Job statistics • Uses XML over TCP
Gangalia: Gmtead • Lightweight service • Receives and sends data obtained from • Gmond • Gmtead • Saves data on disk using RRD (round robin database) • Supports multiple creation of monitoring domains • Reason: Gangalia is very scalable
Gangalia: Web Server & GUI Tools • GUI Tools • PHP scripts which extract data from Gangalia • Generates visualization using graphs. • Web Server • Apache + PHP support to hosts and execute scripts • SSL and XML support is required.
Gangalia: gstat • gstat • Command line tool to extract gmond for information. • Syntax: $gstat --help Usage: gstat [OPTIONS]... -h --help Print help and exit -V --version Print version and exit -a --all List all hosts. Not just hosts running gexec (default=off) -d --dead Print only the hosts which are dead (default=off) -m --mpifile Print a load-balanced mpifile (default=off) -1 --single_line Print host and information all on one line (default=off) -l --list Print ONLY the host list (default=off) -n --numeric Print numeric addresses instead of hostnames (default=off) -iSTRING --gmond_ip=STRING Specify the ip address of the gmond to query (default='127.0.0.1') -pINT --gmond_port=INT Specify the gmond port to query (default=8649)
[IBM (2008), 'Performance Monitoring using Ganglia', IBM Manual - Wiki.]
Prerequisites • Finalize your IP • Finalize you Domain Name • Finalize your time zone • Update the time zone of the machine using NTP • Download following packages • Gangalia [http://sourceforge.net/project/showfiles.php?group_id=43021] • PHP [http://www.php.net/downloads.php] • Apache [http://www.apache.org/dyn/closer.cgi] • rrdtools[http://oss.oetiker.ch/rrdtool/download.en.html]
Steps in Installing Gangalia • Map Monitoring Domains • Choose Central Nodes from Domains • Install gmond on Nodes • Install gmtead on Central Nodes • Install Web Server on Central Nodes