1 / 5

Extend GGUS ALARMs to More Tier0 Services ( TrackTools TF)

WLCG Operations Coordination CERN indico confId =215003. 2012/11/01. Extend GGUS ALARMs to More Tier0 Services ( TrackTools TF). Maria Dimou ( with input from Maite Barroso ) CERN. Which “More Tier0 Services”.

Download Presentation

Extend GGUS ALARMs to More Tier0 Services ( TrackTools TF)

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.


Presentation Transcript

  1. WLCG Operations Coordination CERN indicoconfId=215003 2012/11/01 Extend GGUS ALARMs to More Tier0 Services(TrackTools TF) Maria Dimou (with input from MaiteBarroso)CERN

  2. Which “More Tier0 Services” • ATLAS considers the Impactof twiki and email/e-groups service interruption high. Namely: • Twiki: • An incident produces “damage” in 2 hrs (Urgency: 7). • The whole VO is affected. Most ops services stop (Impact:10). • Email/E-groups: • An incident produces “damage” in 1 hr(Urgency: 8). • Some ops services stop (Impact:9). Source: 2012/03/20 WLCG MB Presentation (slides 8,9,10,15) with values confirmed by ATLAS Deputy Computing Coordinator on 2012/09/17. NB! Terms ‘Urgency’ & ‘Impact’ are NOT related to the expected response by the supporters. Maria Dimou - CERN / WLCG - TrackTools coordinator

  3. What ATLAS requires • The experts (3rd Line Support) to receive instant notification as soon as the ticket is created. • The experiment leaves at the experts’ discretion incident investigation OWH (Outside Working Hours). Maria Dimou - CERN / WLCG - TrackTools coordinator

  4. A workflow that works • GGUS ALARMs are automatically interfaced to SNOW by design and get to the experts with immediate email/phone notifications both from GGUS and email from SNOW. • So for critical services, where immediate attention is needed and justified, GGUS ALARM is the ticket type that the experiment should open. Maria Dimou - CERN / WLCG - TrackTools coordinator

  5. Background and Conclusions Maria Dimou - CERN / WLCG - TrackTools coordinator This is an action from the 2012/10/01 ATLAS-IT meeting. This proposal will be presented at the ATLAS-IT meeting of 2012/11/05. Before any technical action a management decision must be taken for new services’ addition into the ALARM workflow. Is the WLCG MB the right body? Comments/suggestions? Thank You for your attention!

More Related