70 likes | 90 Views
SuperB Nagios Monitoring. Alessandro Paolini INFN-CNAF. Monitoring. Sites status is monitored with NAGIOS: https://sb-serv01.cr.cnaf.infn.it/nagios/ Checks performed on lcg-CE , CREAM-CE and SRM tool provided by EGI project. Monitoring.
E N D
SuperBNagiosMonitoring Alessandro Paolini INFN-CNAF
Monitoring • Sites status ismonitoredwith NAGIOS: • https://sb-serv01.cr.cnaf.infn.it/nagios/ • Checksperformed on lcg-CE, CREAM-CE and SRM • tool provided by EGI project
Monitoring • The SuperBnagios server is hosted by INFN-T1 in the Italian NGI (NGI_IT) and the monitored sites are the ones included in this xml feed file: • http://bbr-serv09.cr.cnaf.infn.it:8080/nagiosvo/superb-sites-for-nagios.xml • The EGI Standard checksare executed: • Job submission (directto CREAM and through WMS), replicas test fromWNsto SRM and directly on SRM • Itwillbeaddedspecificsuperbvo.orgchecks • Whatismonitoredis the service, not the host • eachcheckisperfomedusing GRID stuff
Monitoring CREAM direct job submission WMS job submission Replicas test from WN SRM directchecks
Monitoring • The sites status isstored in anexternal DB • Everytime a service status changes, a nagiospluginupdatesthis DB • The information present in this DB are usedbySuperBFrameworktoprevent the production job submissiontoproblematicsites • Wiki: http://mailman.fe.infn.it/superbwiki/index.php/Distributed_Computing/VO_nagios_monitoring
VO Dashboard • ToolprovidedbyOperationsPortal Team • https://operations-portal.egi.eu/voDashboard?vo=superbvo.org • Access onlyforregistered people (severalrolesavailable) • Itreceivenotificationsbynagios • Itisdisplayedalarmsrelatedtofailingchecksfor the monitoredsites • Sites are groupedby NGI • Opening GGUS ticketsfornotworkingsites and problems follow-up