100 likes | 209 Views
Software WG Update TeraGrid Roundtable. October 21, 2010. Lee Liming, JP Navarro GIG Software Integration. Resource Changes. Added/upgraded/removed Longhorn @ TACC started 1/3/2010 Mercury @ NCSA ended 3/31/2010 BigBen @ PSC ended 3/31/2010 (Dash @ SDSC started 4/1/2010)
E N D
Software WG UpdateTeraGrid Roundtable October 21, 2010 Lee Liming, JP Navarro GIG Software Integration
Resource Changes Added/upgraded/removed • Longhorn @ TACC started 1/3/2010 • Mercury @ NCSA ended 3/31/2010 • BigBen @ PSC ended 3/31/2010 • (Dash @ SDSC started 4/1/2010) • Athena @ NICS started 10/1/2010 • Ember @ NCSA started 10/1/2010 • Nautilus @ NICS started 10/1/2010 • Cobalt @ NCSA ending 11/15/2010 • Lonestar 3 @ TACC ending 12/1/2010 • Black Light @ PSC starts 1/1/2011 • Trestles @ SDSC starts 1/1/2011 • Lonestar 4 @ TACC starts 2/1/2011 http://teragridforum.org/mediawiki/index.php?title=TeraGrid_Resource_Decommission TODAY TeraGrid Roundtable Software WG Update
Local Compute Capability 4.2.1 (Jan 2010) TeraGrid allocated computation support RDR Compute information publishing GLUE2 publishing for meta-scheduling Distributed Programming Systems 4.2.0 (Jan 2010) SAGA pre-release Remote Login Capability 4.0.3 (Feb 2010) Tgusage upgrade Local Compute Capability 4.2.2 (July 2010) GLUE2 publishing update Meta-scheduling Capability 4.2.1 (July 2010) GLUE2 publishing update Remote Compute Support Capability 5.0.1 (July & Oct 2010) GRAM5 Science Gateways Support Capability 5.0.1 (July & Oct 2010) GRAM5 Spruce Improvements to support Computational Clouds Recent Capability Changes TeraGrid Roundtable Software WG Update
Upcoming Capability Changes Data Movement Servers Capability • GridFTP 5.0.2 upgrade Application Dev & RT Capability • Globus 5 client Science Workflow and other Capabilities • Condor/Condor-G update for GRAM5 compatibility Several capability kits • CUE implementation Science Gateway Publishing Capability • Advertise software and service offered by Science Gateways Remote Login Capability • GSI OpenSSH with Logging Data capabilities (Chris) Scheduling capabilities (Warren) TeraGrid Roundtable Software WG Update
TeraGrid outages publishing Comprehensive software search publishing Resource Description Repository publishing Gateway Application Web-services registry publishing John McGee, Jason Reilly at RENCI Recent Information Services Changes TeraGrid Roundtable Software WG Update
GRAM5 – What is it? • Where did it come from? • Based on GT4 GRAM2 code. • Removes some less used features and alters some behaviors, though it remains protocol-compatible with existing GRAM2 deployments. • Significant scalability re-design from GRAM2. • File streaming has been replaced by end-of-job file staging (transparently to the user), and MPICH-G2 multijob coordination is removed from the service. • Includes these TeraGrid’s Job Managers: Condor, PBS, LSF, and SGE. • How GRAM2 compatible? • Same job description language. • Compatibility confirmed with existing GRAM2 clients: globusrun, COG-jglobus, and Condor-G clients submitting and monitoring jobs. • Summary • Based on the most used and reliable GRAM (GRAM2) implementation, • With significant performance and scalability improvements • http://dev.globus.org/wiki/GRAM5_Scalability_Results TeraGrid Roundtable Software WG Update
GRAM 5 Release History • Alpha • Alpha 2 & 3 released Summer 2009 • Tested by LONI-LSU and NCSA July thru September 2009 • Beta • Beta 1 released November 2009 • Production • Version 5.0.0 released January 20, 2010 • Version 5.0.1 released March 27, 2010 • First deployed by TACC, April 2010 • Tested by LONI-LSU, NCSA, and on FutureGrid • Tested by Gateways (NanoHUB) • Version 5.0.2 released July 19, 2010 • TeraGrid capability kits released • Significant QA-WG testing activity • Test runs: 5800/August, 2200/September TeraGrid Roundtable Software WG Update
Important to Science Gateways • Nancy: • “The big push for GRAM5 for gateways is the support for attribute-based authentication. Both ssh-only and pre-WS GRAM support is only available through TG’s GRAM5 packaging. Our initial goal for attribute-based authentication was September 2009 and was delayed by the GRAM5 announcement and the decision to add attribute support only there. We are happy with GRAM5 and feel it’s a move in the right direction for gateways, but still it introduced an unanticipated delay and we’re anxious to move forward.” TeraGrid Roundtable Software WG Update
Deployment Approach • New Science Gateways Support Kit Version 5 • For RPs supporting Science Gateways. • All Gateways using GRAM2 are switching to GRAM5. • New Remote Compute Kit Version 5 • For RPs supporting non-Gateway remote computation from the command line or thru Condor-G. • All users using GRAM2 will switch to GRAM5, except those using mpich-g2. • Transition Plan • RPs without mpich-g2 users can decommission PreWS GRAM2 when GRAM5 goes production. • RPs should keep WS-GRAM (GRAM4) until users have had a chance to port to GRAM5 (we can monitor usage statistics). TeraGrid Roundtable Software WG Update
GRAM5 RP Deployments • Completed Deployments • TACC Lonestar, Ranger • LONI-LSU QueenBee • Purdue Condor, Steele • In-progress/planned Deployment • IU BigRed • NCSA Ember, and other resources • NICS Kraken, Nautilus • PSC Pople, Black Light • SDSC Tresles, Gordon • Deployment plans under discussion • NCAR • ORNL TeraGrid Roundtable Software WG Update