70 likes | 238 Vues
ERCOT Project Update ERCOT Outage Evaluation Phase 2 (SCR745). TDTWG May 7, 2008. PR60006_01 Phase 2 ERCOT Update - Overview. Background: SCR 745: To achieve improved Market performance and reliability through a reduction of ERCOT Retail Systems unplanned outages.
E N D
ERCOT Project UpdateERCOT Outage Evaluation Phase 2 (SCR745) TDTWG May 7, 2008
PR60006_01 Phase 2 ERCOT Update - Overview Background: SCR 745: To achieve improved Market performance and reliability through a reduction of ERCOT Retail Systems unplanned outages. This effort was planned to be implemented in two subprojects; PR60006_01: ERCOT Outage Evaluation Phase I and Phase II • Phase I, NAESB and Proxy Clustered (Delivered 02/2007-Goal Achieved) • Phase II, Paperfree Clustered environment with File Server Redundancy and High Availability PR60006_02: Phase III, Database Clustered environment (Cancelled per recommendations at 04/02/2008 TDTWG) Phase II Status: 02/10/2007 – Implemented Veritas clustered solution resulted in rollback due to unsuccessful failover. 03/08/2008 – Implemented Polyserve clustered solution resulted in rollback due to performance and stability issues (This would have delivered Redundancy and Failover) 05/07/2008 – Seeking recommendations from TDTWG for Next Steps 2
PR60006_01 Phase 2 ERCOT Update – Next Steps Recommendations from HP for Performance improvement will require Architectural changes, server rebuilds, and testing ERCOT Recommends pursuing one of the following Options: 1) Place project “On Hold” due to the following (preferred): • Stabilization of San Switch Replacement Project (Polyserve known issue with loss of connectivity to SAN) • Test Environment Lock down until December 2008 due to Ts and Cs, MarkeTrak, and Nodal • Resource constraints due to Ts and Cs, MarkeTrak, and Nodal • Eliminate additional Finance charges by placing project on Hold • Allow to move forward in 2009 with implementation that will deliver Failover capabilities (High Availability and Redundancy Goal of SCR) 2) Close project and complete effort as O & M: • Additional funding will be required for remaining efforts • Total Project estimated at $1M approved by Board in 2005 • Committed approximately $885K, will require Board approval for additional funding 3
PR60006_01 Phase 2 ERCOT Update – Outages Based on IT incident Report and SCR Metrics 5
PR60006_01 Phase 2 ERCOT Update – PF Outage Details (3yrs) PaperFree Availability Metrics Prior to March 2008 as a result of 2007 Intermediate Resolutions • Previous Logged incident for PaperFree file server – 02/2007. • Until March, 2008 – Paperfree Application was 100% available due to intermediate solutions (meeting SCR Goal for reliability). 6
PR60006_01 Phase 2 ERCOT Update – TDTWG Recommendations Discussion 7