260 likes | 404 Views
Metrics and deployment update. GridPP13. 4 th July 2005. Jeremy Coles J.Coles@rl.ac.uk. Overview. Deployment planning . General metrics – status within EGEE. GridPP performance. Deployment issues. Service challenges. Summary. Deployment plans (process). Deployment plans (issues).
E N D
Metrics and deployment update GridPP13 4th July 2005 Jeremy Coles J.Coles@rl.ac.uk
Overview Deployment planning General metrics – status within EGEE GridPP performance Deployment issues Service challenges Summary
Deployment plans (issues) • GridPP security challenges • UK network tests • Pre-production commitments • UKQCD (need to move to dCache servers) • Tier-2s delivering to MoUs • Non-HEP commitments • Update to deployment web-pages • Review of available documentation • Sysadmin training course II
The “gstat metric” Gstat metric = ((#ok sites)*10+(#info sites)*20+(#note sites)*30+(#warn sites)*40+(#error sites)*50+(#crit sites)*60) / (#sites – (#maint+#off))
Metrics – summary 1 • Average number of published job slots for the last quarter (2477) 2: Average number of job slots used for the last quarter (481) 3: Published storage at the end of the last quarter (64TB) 4: Average gstat service metric for the last quarter (19.8) 5: KSI2K nominally available to LCG at the end of the last quarter (1846 KSI2K) 6: Integrated KSI2K hours available to LCG in the last quarter 7: Disk storage space nominally available to LCG at the end of the last quarter (240 TB) 8: Tape storage space nominally available at the end of the last quarter (239 TB) 9: Disk storage usage by LCG at the end of the last quarter (16 TB) 10: Number of sites publishing accounting data at the end of the last quarter (13) 11: KSI2K hours of CPU processing delivered (per VO) over the last quarter 12: Storage used (per VO) over the last quarter
Metrics – summary 2 13: Number of supported VOs (10) 14: Number of users in supported VOs (other than dteam) at the end of the last quarter (812) 15: Average number of active users (of Tier-1) in supported VOs at the end of the last quarter (46) 16: Percentage of Site Functional Tests results that were passes “OK” over the last quarter (38%) 17: Number of trouble tickets raised against GridPP sites over the last quarter (TBC) 18: Number of sites upgrading in requested time period for last release (16) 19: Accumulated days of scheduled downtime for last quarter (418) 20: Average number of sites per quarter available in VO selections 21: Number of GridPP (site) system security incidents in the last quarter (3) 22: Number of EGEE Grid security incidents in the last quarter (0) 23: Average job success rate over the last quarter for LHC experiments (N/a) 24: GridPP contribution to experiment’s overall running for the last quarter (ALICE:ATLAS:CMS:LHCb; x1: x2%: x3%: x4%)
Current deployment issues Main GridPP concerns: • gLite migration • Fabric management & future of YAIM • SRMs and data migration – dCache/DPM • Security (improving practices and dealing with vulnerabilities) • Ganglia deployment (to provide an overall view of GridPP resources) • Use of ticketing system (support services) • Use of UK testzone • Increase usage of resources • Hold training course (advance on Sysadmin training in Oxford) General • Job success rates at sites – (nb. Freedom of Choice is coming!) • Support more EGEE VOs • GOCDB2
gLite and LCG2 components VOMS Catalogue and access control LFC RB gLite WLM FIREMAN myProxy BD-II BD-II APEL dgas Independent IS R-GMA R-GMA R-GMAs can be merged (security ON) UIs gLite-IO LCG gLite LCG CE SITE CEs use same batch system WNs gLite-CE FTS for LCG uses user proxy, gLite uses service cert FTS FTS shared LCG SRM-SE Data from LCG is owned by VO and role, gLite-IO service owns gLite data gLite
Summary Planning – some areas of concern Available data is improving allowing better monitoring of performance Resources at Tier-2s behind schedule but utilisation is not high Deployment process appears to be improving. 2-6-0 out soon Freedom of Choice & gLite migration starting Service Challenge 3 next week. Good progress. Testing setups now