130 likes | 275 Views
ADCOS Summary. Wahid Bhimji. Overview. All senior shifts covered – thanks to shifters No trainee shifters CENTRAL SERVICES: Wed- T hu FTS3 ( ggus: 106095 closed) Saturday ~4:00-11:30 GGUS ( elog:49792 ok now) Today – Site services (elog:49841, 49836). Jobs. Transfers.
E N D
ADCOS Summary Wahid Bhimji
Overview • All senior shifts covered – thanks to shifters • No trainee shifters CENTRAL SERVICES: • Wed-Thu FTS3 (ggus: 106095 closed) • Saturday ~4:00-11:30 GGUS (elog:49792 ok now) • Today – Site services (elog:49841, 49836)
Daily issues (a few of importance or interest) Wednesday June 11th • Elog:49728 BNL “ddm: Too many attempts” • ADCSUPPORT-3724 Mover changed – closed • Elog: 49739 RAL Disk server • Ggus:106090 server recovered – ggus closed Thursday June 12th • Elog:49751 UKI-SOUTHGRID-OX-HEP Disk server • Ggus: 106114 – fixed
Daily issues cont. Thursday June 12th • Taiwan-LCG2 Stage-out errors • Ggus:106153 – permissions on directory – fixed (also ggus:106190 (Friday) – poss. related srm load Friday June 13th • BNL transfer issues elog:49778 (namespace server hardware resolved quickly)
Daily issues Saturday June 14th • Elog:49797 UKI-NORTHGRID-LIV-HEP disk server firewall ggus:106196 – fixed quickly Sunday June 15th • UKI-LT2-IC-HEP – cvmfs failures – died down but ticket open • NDGF-T1 - out of diskspace for data staging - prod job failures:ggus:106027
Daily issues – today Tuesday June 17th • Very little transfers in DDM • Assigned jobs increasing in many clouds (elog:49841) and jobs failing in NL and DE with “Could not add files to DDM:” (elog:49842) • Various site services not available – e.g. https://sls.cern.ch/sls/service.php?id=atlas-SS07; • https://sls.cern.ch/sls/service.php?id=atlas-SS09 • Some (like FZK one) – now green…
ggus current Us Site-no action Site-action Site-action Site-action Site-no action Us –help? Site-action Site or us? Site or us? Site – no action Who its sitting with. And if site – is there any action being taken
Jiracreated v. resolved Comment: Shifters quite often open duplicate (maybe not as obvious as it could be)