70 likes | 90 Views
SuperBNagios provides comprehensive monitoring of sites through checks on job submission, replicas, and service statuses, aiding in problem prevention and resolution. Utilize the VO Dashboard tool for effective oversight and management.
E N D
SuperBNagiosMonitoring Alessandro Paolini INFN-CNAF
Monitoring • Sites status ismonitoredwith NAGIOS: • https://sb-serv01.cr.cnaf.infn.it/nagios/ • Checksperformed on lcg-CE, CREAM-CE and SRM • tool provided by EGI project
Monitoring • The SuperBnagios server is hosted by INFN-T1 in the Italian NGI (NGI_IT) and the monitored sites are the ones included in this xml feed file: • http://bbr-serv09.cr.cnaf.infn.it:8080/nagiosvo/superb-sites-for-nagios.xml • The EGI Standard checksare executed: • Job submission (directto CREAM and through WMS), replicas test fromWNsto SRM and directly on SRM • Itwillbeaddedspecificsuperbvo.orgchecks • Whatismonitoredis the service, not the host • eachcheckisperfomedusing GRID stuff
Monitoring CREAM direct job submission WMS job submission Replicas test from WN SRM directchecks
Monitoring • The sites status isstored in anexternal DB • Everytime a service status changes, a nagiospluginupdatesthis DB • The information present in this DB are usedbySuperBFrameworktoprevent the production job submissiontoproblematicsites • Wiki: http://mailman.fe.infn.it/superbwiki/index.php/Distributed_Computing/VO_nagios_monitoring
VO Dashboard • ToolprovidedbyOperationsPortal Team • https://operations-portal.egi.eu/voDashboard?vo=superbvo.org • Access onlyforregistered people (severalrolesavailable) • Itreceivenotificationsbynagios • Itisdisplayedalarmsrelatedtofailingchecksfor the monitoredsites • Sites are groupedby NGI • Opening GGUS ticketsfornotworkingsites and problems follow-up