1 / 7

So Now What?

So Now What?. Where to next? WLCG Service Reliability Workshop November 26-30 2007, CERN. Where are we?. We have the lists of prioritized requirements from all LHC experiments Some work is required to consolidate these: Only 1(?) clear mismatch of service level between VOs

Download Presentation

So Now What?

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. So Now What? Where to next? WLCG Service Reliability Workshop November 26-30 2007, CERN

  2. Where are we? • We have the lists of prioritized requirements from all LHC experiments • Some work is required to consolidate these: • Only 1(?) clear mismatch of service level between VOs • But different definition of service level requirements • Critical; Super-critical; Extra-super critical (2h – 30’ max interuption) • DOWN; SERIOUSLY DEGRADED; PERTURBED is probably OK • Need to be realistic about ‘background’ of problems that cannot be avoided • Can only achieve highest level of service with • resources • WORK • “BEST PRACTICES”

  3. Definitions of “Critical” • Quite significant differences in list of services under each heading: • ATLAS: only2 services are in top category (ATONR, DDM central services) • CMS: (long) contains also numerous IT services (incl. phones, kerberos, …) • LHC: CERN LFC, VO boxes, VOMS proxy service • ALICE: CERN VO box, CASTOR + xrootd@T0 (?)

  4. Meanwhile, back on planet Earth… • Measured service / site reliability is WAY BELOW what is requested • There may well be bugs in reporting / measuring, but this giant mis-match has to be addressed! • Target: specific areas, coordinators, actions & timeline • e.g. Service dash for experiment services; • LFC dash status (one of the good ones!) • VOLUNTEERS! • WLCG Collaboration workshop in April 2008 • “Measured improvement in service quality” • CHEP March 2009 • “Solved”

  5. The 2008 International workshop on Resiliency in High Performance Computing (Resilience 2008) http://xcr.cenit.latech.edu/resilience2008/ • In conjunction with the 8th IEEE Intentional Symposium on Cluster Computing and Grid (CCGRID 2008), May 18-22, 2008, Lyon, France. • Important Dates: • Paper Submission Deadline: December 1, 2007

More Related