60 likes | 184 Views
Version 1.0 (meeting edition) 28 May 2009 Rob Kennedy and Adam Lyon Attending: RDK, …. D0 Grid Data Production Initiative: Coordination Mtg. Overview. News and Summary Close-out Prep Meetings D0 CAB: 5/22. This was somewhat brief, but very positive.
E N D
Version 1.0 (meeting edition) 28 May 2009 Rob Kennedy and Adam Lyon Attending: RDK, … D0 Grid Data Production Initiative:Coordination Mtg D0 Grid Data Production
D0 Grid Data Production Overview • News and Summary • Close-out Prep Meetings • D0 CAB: 5/22. This was somewhat brief, but very positive. • Initiative: 6/04. Close-out with Lessons Learned, etc. • Coordination Meetings • Re-located in WH9SE Libra • Remaining meetings: 5/24 (today), 6/04 • Agenda • News • SAMGrid and Condor Upgrades • AOB, Action Items
D0 Grid Data Production Topics Remaining to Cover • 5/14: CAB Configuration First Priority = resource string passing (M Mengel working on it) • Optimize use of CAB resources, beyond just d0farm and CAB2. (providers request) • Retain Turn-around/Response Time for Analysis (customers/users request) • Simplify Production Coordination, Improve Processing Flexibility (all request) • How to Proceed from Here? First, resolve resource string passing, then decide. • 5/21: Monitoring Organize request(s) to fill out and maintain the monitoring plots. • Assess what we all have now, where our gaps are, what would be most cost-effective to address • See Gabriele’s white paper on D0 grid job tracking (includes monitoring, focus on OSG). (CD DocDB 3129) • May also reference Ruth’s look into Monitoring which produced an inventory (CD DocDB 3106) • How to Proceed from Here? Action Item list created. • 5/28: Condor Releases, SAMGrid Upgrade THIS WEEK • Deferred Task (due to All-CAB2 Processing) from Initiative: Release new SAMGrid with added state feature • Upgrade production release of Condor with fixes – modify Condor/VDT upgrade procedure? • How to Proceed from Here? • AOB: Transition of samgrid.fnal.gov support from FGS to FEF; Action Items • 6/04: Close-Out • Lessons Learned, Close-out Festivities Plan
D0 Grid Data Production SAMGrid, Condor Upgrades • New SAMGrid version with added state feature – can proceed to release now • Reminder of what the added state feature entails. Status? • Upgrade of Condor – modify Condor/VDT upgrade procedure to allow Condor-only upgrades? • Now: New Condor released in VDT packaging after SAMGrid dev testing. • Critique: SAMGrid Dev testing tasks become bottleneck, significant effort and risk to VDT upgrades leads to very infrequent VDT (Condor) upgrades. • Some extra effort now to enable for less effort later, and more frequent Condor upgrades • 1. Condor layered on top of VDT – change to deployment configurations. • 2. SAMGrid sanity check procedure runnable by REX to test new SAMGrid/Condor combination to reduce SAMGrid developer involvement. Effort to formalize this. • 3. Practice “new release” with old software to validate procedure at some level. • How to Proceed from Here? • How to agree to the configuration/procedure? • Approximate time table for deployment?
D0 Grid Data Production AOB, Action Items • Transition of samgrid.fnal.gov support from FGS to FEF • Agreement to proceed by Jason and Glenn. • FEF willing to take on machine without doing a full re-install, but would like to poke around before signing off on the transition. Machine is 2.5 years old now, so may not be an issue anyway in 6 months. • Next Step: FGS to contact Jason,Glenn when ready to arrange root access so they can assess config. • Monitoring • Rob to talk to Jason about formalizing a change request for monitoring of PBS/CAB per meeting discussion • Jason and Glenn agree that collecting monitoring requests together into a somewhat formal change request makes sense. They will entertain a prioritized request list, estimate the effort required for the requests, and then “we” can meet to go over what can/will be done. They also ask for a brief use-case for each plot to help them consider equivalent means of supporting the same usage. • Margaret to talk to Keith about possibility of using existing raw data to characterize job idle time etc. • News? • Mike to talk with Robert about data transfer related plots: distinguish external (Enstore) vs internal (Data Production) issues • News? • Other Topics? • … • Close-out Party Suggestions • Rob to arrange, looking for input. Lunch on Friday 6/5?
D0 Grid Data Production Discussion Summary • Topic… • …