240 likes | 361 Views
On-Time Product Delivery COPC - HPCC Best Practices 14-15 March 2011. Allan Darling Deputy Director NCEP Central Operations. Where America’s Climate, Weather, Ocean and Space Weather Services Begin. On-Time Product Delivery. NCEP Mission
E N D
On-Time Product DeliveryCOPC - HPCC Best Practices14-15 March 2011 Allan Darling Deputy Director NCEP Central Operations Where America’s Climate, Weather, Ocean and Space Weather Services Begin
On-Time Product Delivery COPC HPCC Best Practices - 14-15 March 2011
NCEP Mission NCEP delivers science-based environmental predictions to the Nation and the global community. We collaborate with partners and customers to produce reliable, timely, and accurate analyses, guidance, forecasts and warnings for the protection of life and property and the enhancement of the national economy. NCEP Goals and Strategies • Information Systems • Enhance the real-time, on-time, all the time access, display and delivery of NCEP products and services. COPC HPCC Best Practices - 14-15 March 2011
On-Time Product Delivery The principle performance metric for NCEP Operational Supercomputing, measured since 1999 Underlying Philosophy Product delivery is the last event in the whole modeling process. To deliver on time, the entire chain of events must work as intended. One Measurement of Operational Success COPC HPCC Best Practices - 14-15 March 2011
Incentives for Capability COPC HPCC Best Practices - 14-15 March 2011
On-Time Product Delivery Dual System CM & Ops Practice Refinement COPC HPCC Best Practices - 14-15 March 2011
Enabling the Capability On-Time Product Delivery Metrics High Availability Operations Practices Configuration Management System Architecture Technical Practice COPC HPCC Best Practices - 14-15 March 2011
Technical PracticeMeasurement • Products are “on time” if they are released within 15 minutes of their assigned target delivery time • Target delivery times are based on 30-day average availability times of products • Target delivery times are adjusted as needed • Model changes • System changes • New products added as part of the model implementation process • Timeliness measured for ~720,000 products today COPC HPCC Best Practices - 14-15 March 2011
Technical PracticeMeasurement • Some products are excluded from measurement • Inconsistent delivery times (e.g. on-demand dispersion models) • Not delivered through operational dissemination services • Measurement performed daily at 1200Z • Entire previous day • First half of current day COPC HPCC Best Practices - 14-15 March 2011
Operations Practice • Daily Meeting to review: • Operations log • Status of open issues • On time delivery metrics • Calendar of planned events • Weekly Meeting with HPC vendors to review: • Facility and system status • System utilization • Vendor open issues COPC HPCC Best Practices - 14-15 March 2011
On-Time Product Delivery Dual System CM & Ops Practice Refinement COPC HPCC Best Practices - 14-15 March 2011
CM Incentive • Backup supercomputer implemented, with associated IT infrastructure and requirements • Network between systems • System configuration synchronization • Coordinated model implementations • Failover capability Expectation – Better Performance Reality – Greater Complexity COPC HPCC Best Practices - 14-15 March 2011
Configuration Management • Ensure system integrity • Weekly meeting to review executed and proposed changes • Before change occurs… • Validate and test • Schedule appropriately • Review and approve • Communicate with customers • After change occurs… • Identify and communicate outcomes COPC HPCC Best Practices - 14-15 March 2011
Configuration Management • Covers all NCO IT practice, not just supercomputers • Includes NWS and other partners • Full-time staff (primary and backup) • Weekly tempo with daily tie-in to operations COPC HPCC Best Practices - 14-15 March 2011
On-Time Product Delivery Dual System CM & Ops Practice Refinement COPC HPCC Best Practices - 14-15 March 2011
CM Evolution w/ On-time Feedback CM Refinement First CM Attempt CM Process Focus COPC HPCC Best Practices - 14-15 March 2011
Change Metrics Last 12 months – 15 changes withdrawn out of 1004
Ancillary Benefits • Daily review • Identifies performance problems before customers are affected • Reveals silent failures • Weekly & Monthly Reviews • Identify system management gaps • Identify model instability COPC HPCC Best Practices - 14-15 March 2011
On-Time Product Delivery Yearly Average 2006: 99.42% 2007: 99.70% 2008: 99.82% 2009: 99.85% 2010: 99.83% COPC HPCC Best Practices - 14-15 March 2011