1 / 3

IT-PES Monitoring update

IT-PES Monitoring update. Maite Barroso. Summary of wishes. Re-use existing code ( Nagios probes) and integrate with what we’ve got (Lemon/LAS) Stream the alarms (type, severity) Make it easier to do simple correlations Move from the concept of “clusters” to “services”

tale
Download Presentation

IT-PES Monitoring update

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. IT-PES Monitoring update Maite Barroso

  2. Summary of wishes • Re-use existing code (Nagios probes) and integrate with what we’ve got (Lemon/LAS) • Stream the alarms (type, severity) • Make it easier to do simple correlations • Move from the concept of “clusters” to “services” • Support easy verification of automated operations • Email is not a acceptable substitute for monitoring • Collaborate on a lego-set for service managers needing to do service-specific analytics • But not a monster tool • Make use of standard open-source tools

  3. Progress • Nagios to SLS gateway, already used in production (MyProxy service) • Batch monitoring: • Main need: analytics; OK with existing probing of individual machines • Distributed database, Cassandra, to store batch monitoring data, and additional software modules for supporting the special service needs • ActiveMQ

More Related