260 likes | 345 Vues
HP NonStop – R P M R eal-time P rocess M onitor Product Overview Release 1.2 update. HP NonStop Enterprise Division.
E N D
HP NonStop – RPMReal-time Process Monitor Product OverviewRelease 1.2 update HP NonStop Enterprise Division
Process Cpu,Pin RcvQ% Name RPM Busy Programs -------- ------- ----- ------ ------------------\LONDON 9,281 77.46 $SRV9 $DATA2.APP.DBSERVER11:30:00 1,237 63.12 $SRV7 $DATA2.APP.DBSERVER 2,405 51.23 $SRV5 $DATA2.APP.DBSERVER 3,112 19.37 $SRV3 $DATA2.APP.DBSERVER 0,489 11.84 $SRV2 $DATA2.APP.DBSERVER 2,961 8.09 $SRV4 $DATA2.APP.DBSERVER 3,290 5.12 $SRV8 $DATA2.APP.DBSERVER 0,506 4.11 $SRV6 $DATA2.APP.DBSERVER 1,871 3.12 $SRV0 $DATA2.APP.DBSERVER 0,315 0.50 $SRV1 $DATA2.APP.DBSERVER Color coded alert thresholds:Red- CriticalYellow- Warning Blue- Informative HP NonStop Real-time Process Monitor – Overview • RPM – Real-time Process Monitor • Low-cost point-product providesultra-fast, real-time, discovery and display of busiest Cpus, Processes, and network Nodes • Can monitor a Single Cpu, orSelected nodes, or an entireNetwork of over 4000 Cpus • Update 2 provides many new sorting and discovery options... by Cpu, Busy, Inputs, I/Os, Outputs, Memory, Node, Page Faults, Process File Segment, Queue Length, ... • Supports many device-types including TTY, ANSI, VT100, T6530, ... as well as data streaming to disk and process • Fast low-overhead display of core Cpu and Process metrics
RPM – Benefit Overview • Continuously finds busiest Cpus & Processes • Simple and easy to use: TACL 1 > RPM • Ultra-fast startup and display in under 1 second • Ultra-low overhead statistics data-stream more efficient than XML, Java, HTML constructs • Supports wide-range of different device types • User defined thresholds and color-encoded statistics • Critical - Red • Warning – Yellow • Informative – Blue
RPM 1.2 Overview – supported systems • RPM update #2 ( also known as RPM 1.2 )Runs on D-, G-, H-, J-series operating systems • RPM 1.2 - Product ID, OS, T-number, SPR-id • QRPM01V1– RPM for J-series T0877H01^AAC • HRPM01V1– RPM for H-series T0877H01^AAC • RPM01V1 – RPM for D-,G-series T0877V01^AAD • Update #2 new features include: • New By item metric selection and sorting options • New short term and long term display options • Extreme performance enhancements • New command history options
RPM update 2 – New Feature summary • Update 2 – Driven by customer Requests For Enhancement • RFE – Provide 10 new BY item features- Result 10 new By item sorting and selection criteria • RFE – Provide 4 new display options- Result simultaneous viewing of short term, long term, processor and process life-time statistics. • RFE – Provide Command enhancements- Result New History, Execute !, and Fix commands • RFE – Provide Extreme Performance Enhancements- Result ultra-light real-time monitoring of over 4000+ processors
RPM update 2 – Performance enhancements • Update 2 performance enhancements: • Message overhead reduced 100 – 1000x • Cpu overhead reduced 10 – 20x • Response time 10 – 20x faster • Result ultra-light, scalable monitoring of up to 40,000,000 processes • RPM 1.2 performance enhancements were result of using RPM to analyze and tune its own performance.
RPM update 2 – BY item enhancements • Many new selection and sorting options added: • By Busy – process selection and sorting by Cpu busy • By Cpu – processes grouped by Cpu, by node, and by item • By Inputs – process selection and sorting by most inputs • By IOs– process selection and sorting by most IOs • By Outputs – process selection and sorting by Outputs • By Memory– process selection and sorting by memory • By Node– processes grouped by Node • By PFS– process selection and sorting by PFS usage • By Q– process selection and sorting by receive queue • By Swaps– process selection and sorting by page faults
By Memory Process By PFS By Inputs By Busy By Outputs By IOs By Q By Swaps Swap Space RPM update 2 – Process Metric Anatomy By Inputs– process select and sort by most Inputs By Memory– process select and sort by most Memory By Busy – process select and sort by most Busy By Outputs – process select and sort by most Outputs By IOs – process select and sort by most I/Os By PFS– process select and sort by most PFS usage By Q– process select and sort by most receive Queue By Swaps– process select and sort by page Faults Process Cpu,Pin RcvQ% Name RPM Busy Programs -------- ------- ----- ------ ------------------\LONDON 9,281 77.46 $SRV9 $DATA2.APP.DBSERVER11:30:00 1,237 63.12 $SRV7 $DATA2.APP.DBSERVER 2,405 51.23 $SRV5 $DATA2.APP.DBSERVER 3,112 19.37 $SRV3 $DATA2.APP.DBSERVER 0,489 11.84 $SRV2 $DATA2.APP.DBSERVER 2,961 8.09 $SRV4 $DATA2.APP.DBSERVER 3,290 5.12 $SRV8 $DATA2.APP.DBSERVER 0,506 4.11 $SRV6 $DATA2.APP.DBSERVER 1,871 3.12 $SRV0 $DATA2.APP.DBSERVER 0,315 0.50 $SRV1 $DATA2.APP.DBSERVER
RPM update 2 – Displayenhancements • New Display options: • Simultaneous views of short-term and long-term statistics • Short-term views down to 1 second intervals • Long-term views of life-cycle statistics over Elapsed Time since Cpu Load and Elapsed Time since Process launch • EToption shows process short term stats (seconds) and long term stats (across Elapsed Time of process) • ETPCT shows short and long term stats as a percentage • ETALLshows short and long term stats in extreme detailfor both Cpus and Processes (wide zoom view) • DATE shows year/month/day hh:mm started/loaded
RPM update 2 – Command enhancements • New HISTORY, FC, ! command options added • HISTORY - Displays history of commands • HI- Same as HISTORY command • HI 30 – Show 30 most recent commands • FC– Fix of most recent command • FC <number> - Fix command <number> • FC <target> - Fix command starting with <target> • ! – execute most recent command • ! <number> - execute command <number> • ! <target> - execute command <target>
RPM Applicability • RPM was developed, evolved, and refined over multi-year period in HP Labs • Applicable regardless of what products or applications currently used • Runs on multiple nodes, multiple operating-systems, and multiple device-types • Supports • J-, H-, G-, D-series - Operating Systems • Runs in mixed-version, multi-node, mixed-hardware networks • Monitors many Cpus and/or Nodes at once • Built based on long-term development requirement to understand real-time dynamics of software operating in large clusters of disparate NonStop servers • Provides ultra-fast, low-overhead, real-time, super-cluster monitoring capabilities • Customizable • RPM supports small, medium, and super-scalar display types: • From small 80x24 x16-Cpu displays • To medium 132x54 x64-Cpu displays • To giant 200x300 x4000-Cpu displays • Device Types • TTY, T6530, VT100, ANSI • Disk file output • Process file output
RPM General Features • Understands both NSK and OSS file/path names • Install Wizard provides fast setup and configuration eliminates startup learning curve • Supports many different device types including: ANSI, VT100, T6530, TTY, Disk, and Process • Understands T6530 devices for support of legacy displays including 24x80 and 54x132 displays • Understands ANSI/VT100 Super-Sized200x300 character displays which are native (and free) on all Windows and Linux boxes • Supports generic TTY, Disk, Process I/O allowing smart client front-ends and historical stats archival
RPM Analysis & Display Features • Real-time - analysis-and-display of Cpu and Process activity by metric, by Cpu, by node, by selected nodes, by cluster • ADD node - commands allow multiple node analysis-display • By Items– provide metric selection and sorting (new in 1.2) • ByCpu– displays busiest processes in one or more Cpus • ByNode - displays busiest processes in one or more nodes • Results– are sorted, filtered, and color-encoded in real-time • Statistics- synchronized across Cpus, Nodes, and Users (requires neither Measure nor Super) • Example: show busiest processes by Inputs, by Nodes • TACL 1 > RPM P \*, ByInputs, ByNode • Options provide wide-range of analysis and displays • Examples follow...
TACL 1 > RPM PB 3, ENTRIES 10, RATE 5 Process Cpu,Pin Busy% Name RPM T0877 Programs ET= 5.0 Top Pri User -------- ------- ----- -------- -------------------------- --- --- ------- \A 3,73 87.82 $STEP $DATA.DATABASE.QUERY 1 1 66,1 11:32:10 3,320 10.13 $QAZ08 $DATA.DATABASE.QADRV1T 2 168 255,34 3,263 1.20 $HIT $SYSTEM.SYS03.TSYSDP2 3 220 255,255 3,319 .15 $QAZ07 $RPM.RPMQA.QADRV1T 4 168 255,34 3,314 .12 $QAZ06 $RPM.RPMQA.QADRV1T 5 168 255,34 3,175 .09 $ZOOH3 $DATA2.R0403V02.ASAPXMON 6 189 255,255 3,0 .06 $MON $SYSTEM.SYS03.OSIMAGE 7 201 255,255 3,192 .04 $X11W $SYSTEM.SYSTEM.RPMVT 8 168 66,1 3,43 .02 $ZOOM3 $SYSTEM.SYSTEM.ASAPMON 9 160 255,255 3,312 .02 $QAZ04 $RPM.RPMQA.QADRV1T 10 168 255,34 RPM Example #1 – Monitor all processes in one Cpu • PB 3, ENTRIES 10, RATE 5 • PB 3 – displays busiest processes in Cpu 3 • ENTRIES 10 – displays the 10 busiest processes • RATE 5 – updates every 5 seconds ( can sample down to 1 sec ) • Commands can be abbreviated, commas optional, for example:P3 E10 R5
TACL 1 > RPM PB *, ENTRIES 10, RATE 5, ByNode Process Cpu,Pin Busy% Name RPM T0877 Programs ET= 5.0 Top Pri User -------- ------- ----- -------- -------------------------- --- --- ------- \A 3,73 66.24 $STEP $DATA.DATABASE.DRIVER 1 1 66,1 11:59:05 0,41 11.25 $ZEXP $DATA.DATABASE.QUERY 2 170 255,255 1,159 1.82 $Y7Q6 $SYSTEM.SYS03.NSKCOM 3 159 255,255 1,69 1.70 $ZOO $DATA.DATABASE.LISTREC 4 160 255,255 2,271 .58 $HIT $SYSTEM.SYS03.TSYSDP2 5 220 255,255 2,279 .52 $HIT $SYSTEM.SYS03.TSYSDP2 6 220 255,255 2,28 .18 $X90N $SYSTEM.SYSTEM.RPM 7 168 66,1 1,153 .31 $DATA1 $SYSTEM.SYS03.TSYSDP2 8 167 66,1 3,263 .31 $HIT $SYSTEM.SYS03.TSYSDP2 9 220 255,255 0,0 .30 $MON $SYSTEM.SYS03.OSIMAGE 10 201 255,255 RPM Example #2 – All processes in a node ByNode • PB *, ENTRIES 10, RATE 5, ByNode • PB * – displays Process Busy for all Cpus in a node • ENTRIES 10 – displays the 10 busiest processes • RATE 5 – updates every 5 seconds ( can sample down to 1 sec ) • ByNode – displays busiest processes for all Cpus in node & sorts by node
Process Cpu,Pin Busy% Name RPM T0877(01MAR10) ET= 5.0 Top Pri User -------- ------- ----- -------- -------------------------- --- --- ------- \SANFRAN 0,177 71.17 $Y567 $DATA.DATABASE.DRIVER 1 200 255,255 11:30:20 0,1021 14.24 $Y568 $DATA.DATABASE.QUERY 2 201 255,255 0,256 5.12 $T569 $DATA.DATABASE.LISTREC 3 205 255,255 0,257 .10 $TST1 $DATA.DATABASE.QUERY 4 220 255,255 0,15 .08 $NCP $SYSTEM.SYS03.NCPOBJ 5 199 255,255 \SANFRAN 1,182 11.32 $Z121 $DATA1.ANALYSIS.TOOL 1 168 255,255 11:30:20 1,47 5.31 $Y3AL $DATA1.ANALYSIS.QUERY 2 168 66,1 1,14 .23 $NCP $SYSTEM.SYS03.NCPOBJ 3 199 255,255 1,0 .16 $MON $SYSTEM.SYS03.OSIMAGE 4 201 255,255 1,87 .13 $ZOOS $SYSTEM.SYSTEM.RPM 5 160 255,255 \SANFRAN 2,271 1.08 $HIT $SYSTEM.SYS03.TSYSDP2 1 220 255,255 11:30:20 2,178 .31 $Z147 $SYSTEM.SYSTEM.TEDIT 2 160 255,255 2,0 .04 $MON $SYSTEM.SYS03.OSIMAGE 3 201 255,255 2,303 .02 $TT54 $SYSTEM.SAPP.DRV 4 80 255,255 2,275 .01 $TT62 $SYSTEM.SAPP.DRV 5 80 255,255 \SANFRAN 3,324 3.37 $QAZ08 $DATA1.DATABASE.QADRV1T 1 168 255,34 11:30:20 3,263 .28 $HIT $SYSTEM.SYS03.TSYSDP2 2 220 255,255 3,323 .18 $QAZ07 $ASAP.ASAPXQA.QADRV1T 3 168 255,34 3,322 .12 $QAZ06 $ASAP.ASAPXQA.QADRV1T 4 168 255,34 3,0 .08 $MON $SYSTEM.SYS03.OSIMAGE 5 201 255,255 RPM Example #3 – All processes in a node ByCpu • ByCpu • Used for observing load across Cpus • Visual persistence allows viewing of changing load • PB *, Entries 5, Rate 5, ByCpu • PB * – displays Process Busy for all processes in all Cpus in this node • ENTRIES 5 – displays the 5 busiest processes in each Cpu • RATE 5 – updates every 5 seconds – integrate stats over that interval • ByCpu – displays busy processes grouped by Cpu
TACL 1 > RPM PB \*, ENTRIES 5, RATE 5 Realtime Process Monitor (RPM) - T0877V01.00 - (15APR08) +ADD \A +ADD \B +ADD \C Process Cpu,Pin Busy% Name RPM T0877 Programs ET= 5.0 Top Pri User -------- ------- ----- -------- -------------------------- --- --- ------- \A 3,35 74.63 $SPI1 $DATA.DATABASE.DRIVER 1 1 66,1 12:07:05 1,134 51.82 $ZYOB $DATA.DATABASE.QUERY 2 160 255,255 1,175 19.19 $ZERX $DATA.DATBASE.LISTREC 3 167 66,1 1,42 .18 $Z23V $SYSTEM.SYSTEM.DRIVE 4 166 66,1 1,249 .17 $Z244 $SYSTEM.SYSTEM.TEST 5 166 66,1 \B 2,271 4.83 $HIT $SYSTEM.SYS03.TSYSDP2 1 220 255,255 12:07:05 1,254 2.44 $Z08N $SYSTEM.SYSTEM.TEST 2 167 66,1 1,153 .43 $X11Z $SYSTEM.SYSTEM.DRIVE 3 167 66,1 1,74 .18 $RPMX $SYSTEM.SYSTEM.SEEVIEW 4 168 66,1 3,320 .28 $QAZ08 $ASAP.ASAPXQA.QADRV1T 5 168 255,34 \C 0,290 3.76 $SQL $SYSTEM.SYS00.TSYSDP2 1 220 255,255 12:07:05 1,195 2.48 $ZOOL $SYSTEM.SYSTEM.ASAPFIL 2 160 255,255 1,0 1.82 $MON $SYSTEM.SYS00.OSIMAGE 3 201 255,255 0,0 .55 $MON $SYSTEM.SYS00.OSIMAGE 4 201 255,255 0,85 .07 $RPMX $SYSTEM.SYSTEM.SEEVIEW 5 168 66,1 RPM Example #4 – All processes on all nodes • ADD \Node – Adds as many \nodes as you want in RPM analysis and display • PB \*, Entries 5, Rate 5 • PB \* – \* indicates analysis and display is for all ADD-edNodes • ENTRIES 5 – displays the 5 busiest processes in each Node • RATE 5 – updates every 5 seconds – RPM can sample down to 1 second
RPM Example #5 – Composite Cpu + Process reports TACL 1 > RPM ZOOM \*, ENTRIES 5, RATE 5 Realtime Process Monitor (RPM) - T0877V01.00 - (15APR08) +ADD \A +ADD \B +ADD \C Cpus Cp hh:mm ss Busy Secs QLen Disp Disk Chit Swap MLock% Pcb PcbX -------- -- -------- ---- ---- ---- ----- ----- ----- ---- ------ --- ---- \A 0 11:30:45 78 5 0 406 0 4 0 8.00 81 107 1 11:30:45 15 5 0 346 0 0 0 4.29 84 106 2 11:30:45 0 5 0 176 0 40 0 6.73 28 59 3 11:30:45 0 5 0 28 0 0 1 3.54 30 66 \B 0 11:30:45 9 5 0 145 0 15 4 12.51 37 96 1 11:30:45 0 5 0 114 0 0 0 12.32 48 92 2 11:30:45 0 5 0 85 0 0 0 8.32 18 41 3 11:30:45 0 5 0 84 0 0 0 8.33 21 39 \C 0 11:30:45 4 5 0 131 0 4 0 8.01 33 97 1 11:30:45 0 5 0 29 0 0 1 7.65 50 84 Process Cpu,Pin Busy% Name RPM T0877 Programs ET= 5.0 Top Pri User -------- ------- ----- -------- -------------------------- --- --- ------- \A 0,35 74.63 $SPI1 $DATA.DATABASE.DRIVER 1 1 66,1 11:30:45 1,134 11.82 $ZYOB $DATA.DATABASE.QUERY 2 160 255,255 1,175 .19 $ZERX $DATA.DATBASE.LISTREC 3 167 66,1 1,42 .18 $Z23V $SYSTEM.SYSTEM.DRIVE 4 166 66,1 1,249 .17 $Z244 $SYSTEM.SYSTEM.TEST 5 166 66,1 \B 0,271 4.83 $HIT $SYSTEM.SYS03.TSYSDP2 1 220 255,255 11:30:45 1,254 .34 $Z08N $SYSTEM.SYSTEM.TEST 2 167 66,1 1,153 .13 $X11Z $SYSTEM.SYSTEM.DRIVE 3 167 66,1 1,74 .08 $RPMX $SYSTEM.SYSTEM.SEEVIEW 4 168 66,1 3,320 .28 $QAZ08 $ASAP.ASAPXQA.QADRV1T 5 168 255,34 \C 0,290 2.79 $SQL $SYSTEM.SYS00.TSYSDP2 1 220 255,255 11:30:45 1,195 .18 $ZOOL $SYSTEM.SYSTEM.ASAPFIL 2 160 255,255 1,0 .62 $MON $SYSTEM.SYS00.OSIMAGE 3 201 255,255 0,0 .15 $MON $SYSTEM.SYS00.OSIMAGE 4 201 255,255 0,85 .07 $RPMX $SYSTEM.SYSTEM.SEEVIEW 5 168 66,1 • ZOOM \* – Displays both Cpu and Process stats in same display
RPM Sample - RATE discussion • RPM - Easily and efficiently supports update rates down to 1 second • Fast 1 Second rate possible due to low-overhead linear-scalable design • Default is 10 seconds because rapid time varying data difficult for some people to follow • When RPM RATE is 10 seconds the digits to right of %Busy decimal point %0.NN represent Cpu milliseconds used. No other RATE provides this numerical convenience: • Milliseconds SET RATE Percent Cpu Busy • 100ms 10 seconds 1.00% Cpu Busy • 90ms 10 seconds 0.90% Cpu Busy • 80ms 10 seconds 0.80% Cpu Busy • 70ms 10 seconds 0.70% Cpu Busy • 60ms 10 seconds 0.60% Cpu Busy • ... ... ...
RPM Installation – Made easy using RPM Wizard • RPM includes a configuration/install wizard: TACL 1 > RUN RPMWIZ • Wizard interviews user and builds configuration • Enter TYPE of device VT100 | T6530 | TTY ? • Enter RATE interval in seconds 2, 3, 6, [10], .. ? • Enter ENTRIES to display per Cpu or Node [3], 5, 10, ... ? • Enter report Type Process | Cpu | Zoom ? • Enter whether you want to sort by Cpu or [Node] ? • Enter whether you want USEC time displayed Y/[N] ? • Enter %BUSY threshold value [0] ? • Enter INFO Busy% threshold value for INFO [1] ? • Enter WARN Busy% threshold value for WARN [10] ? • Enter CRIT Busy% threshold value for CRIT [50] ? • ADD \sysname ? \A • ADD \sysname ? \B • ADD \sysname ? \C • ADD \sysname ? This slide shows only partial wizard interview,complete example of a Wizard session is at:http://NonstopRPM.com/Example-Wizard.htm
RPM Installation – Use RPM Wizard • At End of Interview, the wizard creates RPMCNF configuration file !====================================================== !== RPM Config created by RPMWIZ - 08/02/22 07:20:08 !====================================================== SET TERM VT100 SET RATE 10 SET ENTRIES 10 SET SORT BYNODE SET USEC OFF SET CRIT 50 SET WARN 10 SET INFO 1 ADD \CENTDIV ADD \SOLAR ADD \WINE !====================================================== SAVE to $DATA.RPMV1.RPMCNF Y/N ? Y $DATA.RPMV1.RPMCNF save started. $DATA.RPMV1.RPMCNF saved. Alert Thresholds
RPM – Interfaces • RPM Supports wide-variety of device types • TTY • ANSI • VT100 (super sized) • T6530 (super sized) • File output • Process output
RPM – T6530 Device Interface - Example T6530 – ExampleP \*, ByNode, RATE 10, Entries 7 • RPM supports T6530 Devices and emulators • Example • RPM 6530 video high lights >> • P\*R10 E7Analyzes 3 nodes, shows 7 busiest processes per node • RATE 10 aka R10 means update every 10 seconds
RPM – Super-Size ANSI displays • RPM ANSI/VT100 device support: • ANSI represents important class of display with very significant benefits: • ANSI/VT100 provides ultra-high performance Super-Sized displays. • No need for emulator on Windows or Linux. VT100 is natively supported on all Windows or Linux computers. • ANSI/VT100 natively supported on virtually ANY Windows PC. From Windows just enter TELNET <ip-addr> • Example • C:\> TELNET <IP-address/DNS-name> • Supports 200 line x 300 char displays • Allows viewing busiest processes on 1000s of Cpus or 100s of Nodes in single screen • RPM example to right shows busiest Cpus and Processes across many Cpus/Nodes • How to create Windows TELNET shortcuts for NonStop servers see:http://NonstopRPM.com/Faqs.htm
RPM – TTY mode Interface - Example RPM TTY Mode - ExampleP \* TTY, ByNode, RATE 10, Entries 15, USEC • RPM supports TTY mode allowing output to generic devices, processes, and files • RPM USEC option displays Time-of-Dayin Micro-Seconds • Nodes in TimeSync • Wine in different Zone • 100ms delta is I/O time • RPM supports both NSK and OSSfile and path names
RPM Web Resources • HP NonStop RPM - Technical Portal http://NonstopRPM.com/ • RPM - Examples http://NonstopRPM.com/Examples.htm • RPM FAQS - Frequently Asked Questions http://NonstopRPM.com/Faqs.htm • RPM Documents http://NonstopRPM.com/Document.htm • RPM Downloads http://NonstopRPM.com/Download.htm