130 likes | 256 Views
ATLAS on Grid3/OSG. R. Gardner December 16, 2004. ATLAS Applications. Pythia Generation Geant4 simulation Pileup Digitization Reconstruction. ATLAS Users. DC2 production team Managed production High priority 7 users User production Opportunistic production and reconstruction
E N D
ATLAS on Grid3/OSG R. Gardner December 16, 2004
ATLAS Applications • Pythia Generation • Geant4 simulation • Pileup • Digitization • Reconstruction
ATLAS Users • DC2 production team • Managed production • High priority • 7 users • User production • Opportunistic production and reconstruction • 3 users • growing
Production statistics on Grid3 (End of November 2004) Overall “success” rate: 74% Through September: 66% During last 2 months: finished: 53163 failed:14353 success rate: 78%. We improved our results since (September) Only 2-3 submit-clients now (10-20 in September ) ATLAS DC2 on Grid3
Job Success Rate on GRID3 • Key factors in improved success rate: • Experienced team using common submit hosts • Quicker response to large scale site/network/hardware failures • Can we improve more? • Some shifts >95% success, others <50% • Automatic throttle for failures? But still lose all running jobs • Do we care? K. De + improvements to Capone/GCE
ATLAS ProdDB
Status of GRID3 Jobs To Do – extra A9 simulation, some digitization and some B1 pile-up Note – also waiting for some B3 and B4 input evgen files from LCG K. De
ATLAS historical use ACDC archive
ATLAS Jobs by site ACDC archive
Grid3OSG Resource Availability • ATLAS expects to be running continuous production starting now throughout 2005 • This activity consists of: • Completion of DC2 • Production for the Rome physics workshop in June • User production via Capone clients • Distributed analysis via ADA • Expect trend towards resource saturation to continue as more users are equipped with job submission tools
Some OSG Issues • Managed storage is now the biggest problem facing continued DC2 production • for both access and space management • Authorization • role based, access rights, queue priorities • policy infrastructure, publication • Accounting service • user-level what resources have been used • cpu, storage over an arbitrary time period • Operations – extend operations protocol between BNL Tier1 and iGOC/OSG operations activity