250 likes | 262 Views
Le point de vue des Utilisateurs. LP Questionnaire Utilisateurs Base: DAST ATLAS & CAF2010 questionnaires, et inputs du CAF Circulé d’abord à CAF+PAF pour commentaires, puis à ATLAS-France-L 39+1 réponses reçues MERCI ! Réponses non-nominatives. Structure du questionnaire. Users profile
E N D
Le point de vue des Utilisateurs LP Questionnaire Utilisateurs Base: DAST ATLAS & CAF2010 questionnaires, et inputs du CAF Circulé d’abord à CAF+PAF pour commentaires, puis à ATLAS-France-L 39+1 réponses reçues MERCI ! Réponses non-nominatives LP
Structure du questionnaire • Users profile • Analysis strategy • Grid • Input, submiqqion, running, output • Final stage analysis • Support • Summary LP
Participants profile (Type, Lab, Group) • Labs & Activities well represented • Mostly standard Athena users LP
Interaction with Computing Interaction with CAF SC shifts Too WEAK interaction with ATLAs computing S&C weeks reports ICB reports CAF reports LP
Analysis type Analysis strategy (1) Input Batch-Grid Input Batch-NonGrid • Grid, non-Grid, Interactive • Input: DPD& Private > AOD, ESD • Batch : Lyon & Grid Grid Jobs submission Batch Analysis submission LP
Analysis strategy (2) Physics: Specific packages? Performance: Specific packages? Packages: PhotonD3PDMaker, MuonAlignTrk/ASAP, SUSYTools DA tool Own framework ? Yes (28%) For input DS acccess (dq2, prun, pbook) TAG & ELSSI not used:not known, not needed, cumbersome ELSSI) • Specific packages used • Tools: pAthena, prun, pbook (func. missing • TAGS& ELSSI not used LP
Analysis strategy (3) Slimming Thinning Skimming Filtering Custom • All input filtered • Mostly skimming, slimming LP
Grid: Retrieving Input AMI dq2-ls Twiki Colleagues Satisfaction • dq2-ls, AMI, twiki • Satisfactory LP
Grid: Jobs testing No test With a few files Tests done 1st locally, interactive,with few files Locally before In batch before LP
Grid: Jobs submission (1) # files in DS used Ouput file size Splitting strategy Exclude sites Specify sites • Rarely sites excluded (lcg-infosites) • Rarely sited favored (Lyon, GRIF, BNL): Grid reliable LP
Grid: Jobs submission (2) Waiting time before running Jobs fraction to resubmit Resubmission frequency • Waiting time: 1-24h • Resubmission for 10-20% jobs LP
Grid: Jobs submission (3) Submission options Ease of use Jobs Monitoring Jobs splitting Time to submit jobs Jobs submission: Satisfactory LP
Grid: Jobs Running Grid responsiveness Monitoring Jobs bookkeeping Jobs running: Satisfactory Grid reliability LP
Grid: Output handling (1) Output format Output handling Create DS to ease up transfers Output storage • Mostly D3PD, NTUP • Retrieve via dq2-get, DATRI • Storage: /sps, local, LOCALGROUPDISK LP
Grid: Output handling (2) Problems: Slow, unstable, heavy, to be tried many times dq2-get • Problems with dq2-get & DATRI • For dq2-get, ATLAS wide issues Problems: Not known, slow, transfer monitoring knowledge DATRI LP
Final stage analysis (1) Framework Intermediate step? No-Athena : Too heavy for the need, slow, weak backward compatibility between releases Other groups D3PD use Interactive analysis • Intermediate step between 1st & final • Mostly ROOT framework • GROUP D3PD well re-used • Interactive analysis at Lyon, Labo, PC • (no PROOF) LP
Final stage analysis (2): Storage With /sps group space With /sps user space With GROUPDISKs location With GROUPDISKs size Overall : Satisfactory With LOCALGROUPDISK size LP
Final stage analysis: PROOF & CHIRP PROOF usage at LYON PROOF Regional Regional: CPPM, LPNHE (8 CPUs) CHIRP knowledge CHIRP usage • PROOF not used • CHIRP not known CHIRP benefits CHIRP tutorial LP
Support & Overall Satisfaction (1) Actions when problems Learning how to use Grid • Support from twiki, colleagues, DAST • Support at Lyon & Sites: OK Satisfaction with LYON Satisfaction with your SITE LP
Support & Overall Satisfaction (2) Support from colleagues Support from online tutos & Doc Response time from Grid mailing lists Solution from Grid mailing lists Over all support from docs, colleagues, Grid mailing lists is OK Mailing lists content & searchability LP
Summary: Grid analysis criticality Locating input Processing time Retrieving output • To be improved: • Processing time • Ability to retrieve outputs LP
Summary: Final analysis criticality Accessing data Processing power • Most Critical: Access to data • Interactive: not an issue Storage Interactive analysis possibility LP
Summary: User satisfaction (1) Ability to find data Jobs submission on Grid Globally, Users are satisfied Grid reliability Grid time response LP
Summary: User satisfaction (2) Jobs monitoring Retrieving outputs Globally, Users are satisfied Ability to get the work done User support LP
Finally • Major Limitations: • Evolving Athena versions • Lyon Downtimes • CPU & storage on local batch (CC & labs) • Skim/slim too long on Gris on big DS • Config/SW non-uniform among sites Need for more info/tutorials • To be improved: • Athena stability & Doc • Retrieval data/job outputs from grid • Failed grid jobs monitoring and possibility to re-launch them • Tutorials & Twikis • Libraries available for C++ coding • Extra: • Positive feedback on user satisfaction • Big improvement in Lyon stability • Panda monitor slow, History not long enough LP