100 likes | 216 Vues
The TeraGrid Roundtable Software Working Group met on October 21, 2010, to discuss significant updates regarding resource changes across various facilities, including the addition of new systems such as Longhorn and Athena, and the decommissioning of older systems like Cobalt and Lonestar 3. They also highlighted advancements in GRAM5, which builds upon previous GRAM2 functionality to enhance performance and scalability. Key points included ongoing support for science gateways, attribute-based authentication, and insights into the deployment strategy for GRAM5 across TeraGrid resources.
E N D
Software WG UpdateTeraGrid Roundtable October 21, 2010 Lee Liming, JP Navarro GIG Software Integration
Resource Changes Added/upgraded/removed • Longhorn @ TACC started 1/3/2010 • Mercury @ NCSA ended 3/31/2010 • BigBen @ PSC ended 3/31/2010 • (Dash @ SDSC started 4/1/2010) • Athena @ NICS started 10/1/2010 • Ember @ NCSA started 10/1/2010 • Nautilus @ NICS started 10/1/2010 • Cobalt @ NCSA ending 11/15/2010 • Lonestar 3 @ TACC ending 12/1/2010 • Black Light @ PSC starts 1/1/2011 • Trestles @ SDSC starts 1/1/2011 • Lonestar 4 @ TACC starts 2/1/2011 http://teragridforum.org/mediawiki/index.php?title=TeraGrid_Resource_Decommission TODAY TeraGrid Roundtable Software WG Update
Local Compute Capability 4.2.1 (Jan 2010) TeraGrid allocated computation support RDR Compute information publishing GLUE2 publishing for meta-scheduling Distributed Programming Systems 4.2.0 (Jan 2010) SAGA pre-release Remote Login Capability 4.0.3 (Feb 2010) Tgusage upgrade Local Compute Capability 4.2.2 (July 2010) GLUE2 publishing update Meta-scheduling Capability 4.2.1 (July 2010) GLUE2 publishing update Remote Compute Support Capability 5.0.1 (July & Oct 2010) GRAM5 Science Gateways Support Capability 5.0.1 (July & Oct 2010) GRAM5 Spruce Improvements to support Computational Clouds Recent Capability Changes TeraGrid Roundtable Software WG Update
Upcoming Capability Changes Data Movement Servers Capability • GridFTP 5.0.2 upgrade Application Dev & RT Capability • Globus 5 client Science Workflow and other Capabilities • Condor/Condor-G update for GRAM5 compatibility Several capability kits • CUE implementation Science Gateway Publishing Capability • Advertise software and service offered by Science Gateways Remote Login Capability • GSI OpenSSH with Logging Data capabilities (Chris) Scheduling capabilities (Warren) TeraGrid Roundtable Software WG Update
TeraGrid outages publishing Comprehensive software search publishing Resource Description Repository publishing Gateway Application Web-services registry publishing John McGee, Jason Reilly at RENCI Recent Information Services Changes TeraGrid Roundtable Software WG Update
GRAM5 – What is it? • Where did it come from? • Based on GT4 GRAM2 code. • Removes some less used features and alters some behaviors, though it remains protocol-compatible with existing GRAM2 deployments. • Significant scalability re-design from GRAM2. • File streaming has been replaced by end-of-job file staging (transparently to the user), and MPICH-G2 multijob coordination is removed from the service. • Includes these TeraGrid’s Job Managers: Condor, PBS, LSF, and SGE. • How GRAM2 compatible? • Same job description language. • Compatibility confirmed with existing GRAM2 clients: globusrun, COG-jglobus, and Condor-G clients submitting and monitoring jobs. • Summary • Based on the most used and reliable GRAM (GRAM2) implementation, • With significant performance and scalability improvements • http://dev.globus.org/wiki/GRAM5_Scalability_Results TeraGrid Roundtable Software WG Update
GRAM 5 Release History • Alpha • Alpha 2 & 3 released Summer 2009 • Tested by LONI-LSU and NCSA July thru September 2009 • Beta • Beta 1 released November 2009 • Production • Version 5.0.0 released January 20, 2010 • Version 5.0.1 released March 27, 2010 • First deployed by TACC, April 2010 • Tested by LONI-LSU, NCSA, and on FutureGrid • Tested by Gateways (NanoHUB) • Version 5.0.2 released July 19, 2010 • TeraGrid capability kits released • Significant QA-WG testing activity • Test runs: 5800/August, 2200/September TeraGrid Roundtable Software WG Update
Important to Science Gateways • Nancy: • “The big push for GRAM5 for gateways is the support for attribute-based authentication. Both ssh-only and pre-WS GRAM support is only available through TG’s GRAM5 packaging. Our initial goal for attribute-based authentication was September 2009 and was delayed by the GRAM5 announcement and the decision to add attribute support only there. We are happy with GRAM5 and feel it’s a move in the right direction for gateways, but still it introduced an unanticipated delay and we’re anxious to move forward.” TeraGrid Roundtable Software WG Update
Deployment Approach • New Science Gateways Support Kit Version 5 • For RPs supporting Science Gateways. • All Gateways using GRAM2 are switching to GRAM5. • New Remote Compute Kit Version 5 • For RPs supporting non-Gateway remote computation from the command line or thru Condor-G. • All users using GRAM2 will switch to GRAM5, except those using mpich-g2. • Transition Plan • RPs without mpich-g2 users can decommission PreWS GRAM2 when GRAM5 goes production. • RPs should keep WS-GRAM (GRAM4) until users have had a chance to port to GRAM5 (we can monitor usage statistics). TeraGrid Roundtable Software WG Update
GRAM5 RP Deployments • Completed Deployments • TACC Lonestar, Ranger • LONI-LSU QueenBee • Purdue Condor, Steele • In-progress/planned Deployment • IU BigRed • NCSA Ember, and other resources • NICS Kraken, Nautilus • PSC Pople, Black Light • SDSC Tresles, Gordon • Deployment plans under discussion • NCAR • ORNL TeraGrid Roundtable Software WG Update