1 / 26

HP NonStop – R P M R eal-time P rocess M onitor Product Overview Release 1.2 update

HP NonStop – R P M R eal-time P rocess M onitor Product Overview Release 1.2 update. HP NonStop Enterprise Division.

erling
Télécharger la présentation

HP NonStop – R P M R eal-time P rocess M onitor Product Overview Release 1.2 update

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. HP NonStop – RPMReal-time Process Monitor Product OverviewRelease 1.2 update HP NonStop Enterprise Division

  2. Process Cpu,Pin RcvQ% Name RPM Busy Programs -------- ------- ----- ------ ------------------\LONDON 9,281  77.46 $SRV9 $DATA2.APP.DBSERVER11:30:00 1,237 63.12 $SRV7 $DATA2.APP.DBSERVER 2,405 51.23 $SRV5 $DATA2.APP.DBSERVER 3,112 19.37 $SRV3 $DATA2.APP.DBSERVER 0,489 11.84 $SRV2 $DATA2.APP.DBSERVER 2,961 8.09 $SRV4 $DATA2.APP.DBSERVER 3,290 5.12 $SRV8 $DATA2.APP.DBSERVER 0,506 4.11 $SRV6 $DATA2.APP.DBSERVER 1,871 3.12 $SRV0 $DATA2.APP.DBSERVER 0,315 0.50 $SRV1 $DATA2.APP.DBSERVER Color coded alert thresholds:Red- CriticalYellow- Warning Blue- Informative HP NonStop Real-time Process Monitor – Overview • RPM – Real-time Process Monitor • Low-cost point-product providesultra-fast, real-time, discovery and display of busiest Cpus, Processes, and network Nodes • Can monitor a Single Cpu, orSelected nodes, or an entireNetwork of over 4000 Cpus • Update 2 provides many new sorting and discovery options... by Cpu, Busy, Inputs, I/Os, Outputs, Memory, Node, Page Faults, Process File Segment, Queue Length, ... • Supports many device-types including TTY, ANSI, VT100, T6530, ... as well as data streaming to disk and process • Fast low-overhead display of core Cpu and Process metrics

  3. RPM – Benefit Overview • Continuously finds busiest Cpus & Processes • Simple and easy to use: TACL 1 > RPM • Ultra-fast startup and display in under 1 second • Ultra-low overhead statistics data-stream more efficient than XML, Java, HTML constructs • Supports wide-range of different device types • User defined thresholds and color-encoded statistics • Critical - Red • Warning – Yellow • Informative – Blue

  4. RPM 1.2 Overview – supported systems • RPM update #2 ( also known as RPM 1.2 )Runs on D-, G-, H-, J-series operating systems • RPM 1.2 - Product ID, OS, T-number, SPR-id • QRPM01V1– RPM for J-series T0877H01^AAC • HRPM01V1– RPM for H-series T0877H01^AAC • RPM01V1 – RPM for D-,G-series T0877V01^AAD • Update #2 new features include: • New By item metric selection and sorting options • New short term and long term display options • Extreme performance enhancements • New command history options

  5. RPM update 2 – New Feature summary • Update 2 – Driven by customer Requests For Enhancement • RFE – Provide 10 new BY item features- Result 10 new By item sorting and selection criteria • RFE – Provide 4 new display options- Result simultaneous viewing of short term, long term, processor and process life-time statistics. • RFE – Provide Command enhancements- Result New History, Execute !, and Fix commands • RFE – Provide Extreme Performance Enhancements- Result ultra-light real-time monitoring of over 4000+ processors

  6. RPM update 2 – Performance enhancements • Update 2 performance enhancements: • Message overhead reduced 100 – 1000x • Cpu overhead reduced 10 – 20x • Response time 10 – 20x faster • Result ultra-light, scalable monitoring of up to 40,000,000 processes • RPM 1.2 performance enhancements were result of using RPM to analyze and tune its own performance.

  7. RPM update 2 – BY item enhancements • Many new selection and sorting options added: • By Busy – process selection and sorting by Cpu busy • By Cpu – processes grouped by Cpu, by node, and by item • By Inputs – process selection and sorting by most inputs • By IOs– process selection and sorting by most IOs • By Outputs – process selection and sorting by Outputs • By Memory– process selection and sorting by memory • By Node– processes grouped by Node • By PFS– process selection and sorting by PFS usage • By Q– process selection and sorting by receive queue • By Swaps– process selection and sorting by page faults

  8. By Memory Process By PFS By Inputs By Busy By Outputs By IOs By Q By Swaps Swap Space RPM update 2 – Process Metric Anatomy By Inputs– process select and sort by most Inputs By Memory– process select and sort by most Memory By Busy – process select and sort by most Busy By Outputs – process select and sort by most Outputs By IOs – process select and sort by most I/Os By PFS– process select and sort by most PFS usage By Q– process select and sort by most receive Queue By Swaps– process select and sort by page Faults Process Cpu,Pin RcvQ% Name RPM Busy Programs -------- ------- ----- ------ ------------------\LONDON 9,281  77.46 $SRV9 $DATA2.APP.DBSERVER11:30:00 1,237 63.12 $SRV7 $DATA2.APP.DBSERVER 2,405 51.23 $SRV5 $DATA2.APP.DBSERVER 3,112 19.37 $SRV3 $DATA2.APP.DBSERVER 0,489 11.84 $SRV2 $DATA2.APP.DBSERVER 2,961 8.09 $SRV4 $DATA2.APP.DBSERVER 3,290 5.12 $SRV8 $DATA2.APP.DBSERVER 0,506 4.11 $SRV6 $DATA2.APP.DBSERVER 1,871 3.12 $SRV0 $DATA2.APP.DBSERVER 0,315 0.50 $SRV1 $DATA2.APP.DBSERVER

  9. RPM update 2 – Displayenhancements • New Display options: • Simultaneous views of short-term and long-term statistics • Short-term views down to 1 second intervals • Long-term views of life-cycle statistics over Elapsed Time since Cpu Load and Elapsed Time since Process launch • EToption shows process short term stats (seconds) and long term stats (across Elapsed Time of process) • ETPCT shows short and long term stats as a percentage • ETALLshows short and long term stats in extreme detailfor both Cpus and Processes (wide zoom view) • DATE shows year/month/day hh:mm started/loaded

  10. RPM update 2 – Command enhancements • New HISTORY, FC, ! command options added • HISTORY - Displays history of commands • HI- Same as HISTORY command • HI 30 – Show 30 most recent commands • FC– Fix of most recent command • FC <number> - Fix command <number> • FC <target> - Fix command starting with <target> • ! – execute most recent command • ! <number> - execute command <number> • ! <target> - execute command <target>

  11. RPM Applicability • RPM was developed, evolved, and refined over multi-year period in HP Labs • Applicable regardless of what products or applications currently used • Runs on multiple nodes, multiple operating-systems, and multiple device-types • Supports • J-, H-, G-, D-series - Operating Systems • Runs in mixed-version, multi-node, mixed-hardware networks • Monitors many Cpus and/or Nodes at once • Built based on long-term development requirement to understand real-time dynamics of software operating in large clusters of disparate NonStop servers • Provides ultra-fast, low-overhead, real-time, super-cluster monitoring capabilities • Customizable • RPM supports small, medium, and super-scalar display types: • From small 80x24 x16-Cpu displays • To medium 132x54 x64-Cpu displays • To giant 200x300 x4000-Cpu displays • Device Types • TTY, T6530, VT100, ANSI • Disk file output • Process file output

  12. RPM General Features • Understands both NSK and OSS file/path names • Install Wizard provides fast setup and configuration eliminates startup learning curve • Supports many different device types including: ANSI, VT100, T6530, TTY, Disk, and Process • Understands T6530 devices for support of legacy displays including 24x80 and 54x132 displays • Understands ANSI/VT100 Super-Sized200x300 character displays which are native (and free) on all Windows and Linux boxes • Supports generic TTY, Disk, Process I/O allowing smart client front-ends and historical stats archival

  13. RPM Analysis & Display Features • Real-time - analysis-and-display of Cpu and Process activity by metric, by Cpu, by node, by selected nodes, by cluster • ADD node - commands allow multiple node analysis-display • By Items– provide metric selection and sorting (new in 1.2) • ByCpu– displays busiest processes in one or more Cpus • ByNode - displays busiest processes in one or more nodes • Results– are sorted, filtered, and color-encoded in real-time • Statistics- synchronized across Cpus, Nodes, and Users (requires neither Measure nor Super) • Example: show busiest processes by Inputs, by Nodes • TACL 1 > RPM P \*, ByInputs, ByNode • Options provide wide-range of analysis and displays • Examples follow...

  14. TACL 1 > RPM PB 3, ENTRIES 10, RATE 5 Process Cpu,Pin Busy% Name RPM T0877 Programs ET= 5.0 Top Pri User -------- ------- ----- -------- -------------------------- --- --- ------- \A 3,73 87.82 $STEP $DATA.DATABASE.QUERY 1 1 66,1 11:32:10 3,320 10.13 $QAZ08 $DATA.DATABASE.QADRV1T 2 168 255,34 3,263 1.20 $HIT $SYSTEM.SYS03.TSYSDP2 3 220 255,255 3,319 .15 $QAZ07 $RPM.RPMQA.QADRV1T 4 168 255,34 3,314 .12 $QAZ06 $RPM.RPMQA.QADRV1T 5 168 255,34 3,175 .09 $ZOOH3 $DATA2.R0403V02.ASAPXMON 6 189 255,255 3,0 .06 $MON $SYSTEM.SYS03.OSIMAGE 7 201 255,255 3,192 .04 $X11W $SYSTEM.SYSTEM.RPMVT 8 168 66,1 3,43 .02 $ZOOM3 $SYSTEM.SYSTEM.ASAPMON 9 160 255,255 3,312 .02 $QAZ04 $RPM.RPMQA.QADRV1T 10 168 255,34 RPM Example #1 – Monitor all processes in one Cpu • PB 3, ENTRIES 10, RATE 5 • PB 3 – displays busiest processes in Cpu 3 • ENTRIES 10 – displays the 10 busiest processes • RATE 5 – updates every 5 seconds ( can sample down to 1 sec ) • Commands can be abbreviated, commas optional, for example:P3 E10 R5

  15. TACL 1 > RPM PB *, ENTRIES 10, RATE 5, ByNode Process Cpu,Pin Busy% Name RPM T0877 Programs ET= 5.0 Top Pri User -------- ------- ----- -------- -------------------------- --- --- ------- \A 3,73 66.24 $STEP $DATA.DATABASE.DRIVER 1 1 66,1 11:59:05 0,41 11.25 $ZEXP $DATA.DATABASE.QUERY 2 170 255,255 1,159 1.82 $Y7Q6 $SYSTEM.SYS03.NSKCOM 3 159 255,255 1,69 1.70 $ZOO $DATA.DATABASE.LISTREC 4 160 255,255 2,271 .58 $HIT $SYSTEM.SYS03.TSYSDP2 5 220 255,255 2,279 .52 $HIT $SYSTEM.SYS03.TSYSDP2 6 220 255,255 2,28 .18 $X90N $SYSTEM.SYSTEM.RPM 7 168 66,1 1,153 .31 $DATA1 $SYSTEM.SYS03.TSYSDP2 8 167 66,1 3,263 .31 $HIT $SYSTEM.SYS03.TSYSDP2 9 220 255,255 0,0 .30 $MON $SYSTEM.SYS03.OSIMAGE 10 201 255,255 RPM Example #2 – All processes in a node ByNode • PB *, ENTRIES 10, RATE 5, ByNode • PB * – displays Process Busy for all Cpus in a node • ENTRIES 10 – displays the 10 busiest processes • RATE 5 – updates every 5 seconds ( can sample down to 1 sec ) • ByNode – displays busiest processes for all Cpus in node & sorts by node

  16. Process Cpu,Pin Busy% Name RPM T0877(01MAR10) ET= 5.0 Top Pri User -------- ------- ----- -------- -------------------------- --- --- ------- \SANFRAN 0,177 71.17 $Y567 $DATA.DATABASE.DRIVER 1 200 255,255 11:30:20 0,1021 14.24 $Y568 $DATA.DATABASE.QUERY 2 201 255,255 0,256 5.12 $T569 $DATA.DATABASE.LISTREC 3 205 255,255 0,257 .10 $TST1 $DATA.DATABASE.QUERY 4 220 255,255 0,15 .08 $NCP $SYSTEM.SYS03.NCPOBJ 5 199 255,255 \SANFRAN 1,182 11.32 $Z121 $DATA1.ANALYSIS.TOOL 1 168 255,255 11:30:20 1,47 5.31 $Y3AL $DATA1.ANALYSIS.QUERY 2 168 66,1 1,14 .23 $NCP $SYSTEM.SYS03.NCPOBJ 3 199 255,255 1,0 .16 $MON $SYSTEM.SYS03.OSIMAGE 4 201 255,255 1,87 .13 $ZOOS $SYSTEM.SYSTEM.RPM 5 160 255,255 \SANFRAN 2,271 1.08 $HIT $SYSTEM.SYS03.TSYSDP2 1 220 255,255 11:30:20 2,178 .31 $Z147 $SYSTEM.SYSTEM.TEDIT 2 160 255,255 2,0 .04 $MON $SYSTEM.SYS03.OSIMAGE 3 201 255,255 2,303 .02 $TT54 $SYSTEM.SAPP.DRV 4 80 255,255 2,275 .01 $TT62 $SYSTEM.SAPP.DRV 5 80 255,255 \SANFRAN 3,324 3.37 $QAZ08 $DATA1.DATABASE.QADRV1T 1 168 255,34 11:30:20 3,263 .28 $HIT $SYSTEM.SYS03.TSYSDP2 2 220 255,255 3,323 .18 $QAZ07 $ASAP.ASAPXQA.QADRV1T 3 168 255,34 3,322 .12 $QAZ06 $ASAP.ASAPXQA.QADRV1T 4 168 255,34 3,0 .08 $MON $SYSTEM.SYS03.OSIMAGE 5 201 255,255 RPM Example #3 – All processes in a node ByCpu • ByCpu • Used for observing load across Cpus • Visual persistence allows viewing of changing load • PB *, Entries 5, Rate 5, ByCpu • PB * – displays Process Busy for all processes in all Cpus in this node • ENTRIES 5 – displays the 5 busiest processes in each Cpu • RATE 5 – updates every 5 seconds – integrate stats over that interval • ByCpu – displays busy processes grouped by Cpu

  17. TACL 1 > RPM PB \*, ENTRIES 5, RATE 5 Realtime Process Monitor (RPM) - T0877V01.00 - (15APR08) +ADD \A +ADD \B +ADD \C Process Cpu,Pin Busy% Name RPM T0877 Programs ET= 5.0 Top Pri User -------- ------- ----- -------- -------------------------- --- --- ------- \A 3,35 74.63 $SPI1 $DATA.DATABASE.DRIVER 1 1 66,1 12:07:05 1,134 51.82 $ZYOB $DATA.DATABASE.QUERY 2 160 255,255 1,175 19.19 $ZERX $DATA.DATBASE.LISTREC 3 167 66,1 1,42 .18 $Z23V $SYSTEM.SYSTEM.DRIVE 4 166 66,1 1,249 .17 $Z244 $SYSTEM.SYSTEM.TEST 5 166 66,1 \B 2,271 4.83 $HIT $SYSTEM.SYS03.TSYSDP2 1 220 255,255 12:07:05 1,254 2.44 $Z08N $SYSTEM.SYSTEM.TEST 2 167 66,1 1,153 .43 $X11Z $SYSTEM.SYSTEM.DRIVE 3 167 66,1 1,74 .18 $RPMX $SYSTEM.SYSTEM.SEEVIEW 4 168 66,1 3,320 .28 $QAZ08 $ASAP.ASAPXQA.QADRV1T 5 168 255,34 \C 0,290 3.76 $SQL $SYSTEM.SYS00.TSYSDP2 1 220 255,255 12:07:05 1,195 2.48 $ZOOL $SYSTEM.SYSTEM.ASAPFIL 2 160 255,255 1,0 1.82 $MON $SYSTEM.SYS00.OSIMAGE 3 201 255,255 0,0 .55 $MON $SYSTEM.SYS00.OSIMAGE 4 201 255,255 0,85 .07 $RPMX $SYSTEM.SYSTEM.SEEVIEW 5 168 66,1 RPM Example #4 – All processes on all nodes • ADD \Node – Adds as many \nodes as you want in RPM analysis and display • PB \*, Entries 5, Rate 5 • PB \* – \* indicates analysis and display is for all ADD-edNodes • ENTRIES 5 – displays the 5 busiest processes in each Node • RATE 5 – updates every 5 seconds – RPM can sample down to 1 second

  18. RPM Example #5 – Composite Cpu + Process reports TACL 1 > RPM ZOOM \*, ENTRIES 5, RATE 5 Realtime Process Monitor (RPM) - T0877V01.00 - (15APR08) +ADD \A +ADD \B +ADD \C Cpus Cp hh:mm ss Busy Secs QLen Disp Disk Chit Swap MLock% Pcb PcbX -------- -- -------- ---- ---- ---- ----- ----- ----- ---- ------ --- ---- \A 0 11:30:45 78 5 0 406 0 4 0 8.00 81 107 1 11:30:45 15 5 0 346 0 0 0 4.29 84 106 2 11:30:45 0 5 0 176 0 40 0 6.73 28 59 3 11:30:45 0 5 0 28 0 0 1 3.54 30 66 \B 0 11:30:45 9 5 0 145 0 15 4 12.51 37 96 1 11:30:45 0 5 0 114 0 0 0 12.32 48 92 2 11:30:45 0 5 0 85 0 0 0 8.32 18 41 3 11:30:45 0 5 0 84 0 0 0 8.33 21 39 \C 0 11:30:45 4 5 0 131 0 4 0 8.01 33 97 1 11:30:45 0 5 0 29 0 0 1 7.65 50 84 Process Cpu,Pin Busy% Name RPM T0877 Programs ET= 5.0 Top Pri User -------- ------- ----- -------- -------------------------- --- --- ------- \A 0,35 74.63 $SPI1 $DATA.DATABASE.DRIVER 1 1 66,1 11:30:45 1,134 11.82 $ZYOB $DATA.DATABASE.QUERY 2 160 255,255 1,175 .19 $ZERX $DATA.DATBASE.LISTREC 3 167 66,1 1,42 .18 $Z23V $SYSTEM.SYSTEM.DRIVE 4 166 66,1 1,249 .17 $Z244 $SYSTEM.SYSTEM.TEST 5 166 66,1 \B 0,271 4.83 $HIT $SYSTEM.SYS03.TSYSDP2 1 220 255,255 11:30:45 1,254 .34 $Z08N $SYSTEM.SYSTEM.TEST 2 167 66,1 1,153 .13 $X11Z $SYSTEM.SYSTEM.DRIVE 3 167 66,1 1,74 .08 $RPMX $SYSTEM.SYSTEM.SEEVIEW 4 168 66,1 3,320 .28 $QAZ08 $ASAP.ASAPXQA.QADRV1T 5 168 255,34 \C 0,290 2.79 $SQL $SYSTEM.SYS00.TSYSDP2 1 220 255,255 11:30:45 1,195 .18 $ZOOL $SYSTEM.SYSTEM.ASAPFIL 2 160 255,255 1,0 .62 $MON $SYSTEM.SYS00.OSIMAGE 3 201 255,255 0,0 .15 $MON $SYSTEM.SYS00.OSIMAGE 4 201 255,255 0,85 .07 $RPMX $SYSTEM.SYSTEM.SEEVIEW 5 168 66,1 • ZOOM \* – Displays both Cpu and Process stats in same display

  19. RPM Sample - RATE discussion • RPM - Easily and efficiently supports update rates down to 1 second • Fast 1 Second rate possible due to low-overhead linear-scalable design • Default is 10 seconds because rapid time varying data difficult for some people to follow • When RPM RATE is 10 seconds the digits to right of %Busy decimal point %0.NN represent Cpu milliseconds used. No other RATE provides this numerical convenience: • Milliseconds SET RATE Percent Cpu Busy • 100ms 10 seconds 1.00% Cpu Busy • 90ms 10 seconds 0.90% Cpu Busy • 80ms 10 seconds 0.80% Cpu Busy • 70ms 10 seconds 0.70% Cpu Busy • 60ms 10 seconds 0.60% Cpu Busy • ... ... ...

  20. RPM Installation – Made easy using RPM Wizard • RPM includes a configuration/install wizard: TACL 1 > RUN RPMWIZ • Wizard interviews user and builds configuration • Enter TYPE of device VT100 | T6530 | TTY ? • Enter RATE interval in seconds 2, 3, 6, [10], .. ? • Enter ENTRIES to display per Cpu or Node [3], 5, 10, ... ? • Enter report Type Process | Cpu | Zoom ? • Enter whether you want to sort by Cpu or [Node] ? • Enter whether you want USEC time displayed Y/[N] ? • Enter %BUSY threshold value [0] ? • Enter INFO Busy% threshold value for INFO [1]  ? • Enter WARN Busy% threshold value for WARN [10] ? • Enter CRIT Busy% threshold value for CRIT [50] ? • ADD \sysname ? \A • ADD \sysname ? \B • ADD \sysname ? \C • ADD \sysname ? This slide shows only partial wizard interview,complete example of a Wizard session is at:http://NonstopRPM.com/Example-Wizard.htm

  21. RPM Installation – Use RPM Wizard • At End of Interview, the wizard creates RPMCNF configuration file !====================================================== !== RPM Config created by RPMWIZ - 08/02/22 07:20:08 !====================================================== SET TERM VT100 SET RATE 10 SET ENTRIES 10 SET SORT BYNODE SET USEC OFF SET CRIT 50 SET WARN 10 SET INFO 1 ADD \CENTDIV ADD \SOLAR ADD \WINE !====================================================== SAVE to $DATA.RPMV1.RPMCNF Y/N ? Y $DATA.RPMV1.RPMCNF save started. $DATA.RPMV1.RPMCNF saved. Alert Thresholds

  22. RPM – Interfaces • RPM Supports wide-variety of device types • TTY • ANSI • VT100 (super sized) • T6530 (super sized) • File output • Process output

  23. RPM – T6530 Device Interface - Example T6530 – ExampleP \*, ByNode, RATE 10, Entries 7 • RPM supports T6530 Devices and emulators • Example • RPM 6530 video high lights >> • P\*R10 E7Analyzes 3 nodes, shows 7 busiest processes per node • RATE 10 aka R10 means update every 10 seconds

  24. RPM – Super-Size ANSI displays • RPM ANSI/VT100 device support: • ANSI represents important class of display with very significant benefits: • ANSI/VT100 provides ultra-high performance Super-Sized displays. • No need for emulator on Windows or Linux. VT100 is natively supported on all Windows or Linux computers. • ANSI/VT100 natively supported on virtually ANY Windows PC. From Windows just enter TELNET <ip-addr> • Example • C:\> TELNET <IP-address/DNS-name> • Supports 200 line x 300 char displays • Allows viewing busiest processes on 1000s of Cpus or 100s of Nodes in single screen • RPM example to right shows busiest Cpus and Processes across many Cpus/Nodes • How to create Windows TELNET shortcuts for NonStop servers see:http://NonstopRPM.com/Faqs.htm

  25. RPM – TTY mode Interface - Example RPM TTY Mode - ExampleP \* TTY, ByNode, RATE 10, Entries 15, USEC • RPM supports TTY mode allowing output to generic devices, processes, and files • RPM USEC option displays Time-of-Dayin Micro-Seconds • Nodes in TimeSync • Wine in different Zone • 100ms delta is I/O time • RPM supports both NSK and OSSfile and path names

  26. RPM Web Resources • HP NonStop RPM - Technical Portal http://NonstopRPM.com/ • RPM - Examples http://NonstopRPM.com/Examples.htm • RPM FAQS - Frequently Asked Questions http://NonstopRPM.com/Faqs.htm • RPM Documents http://NonstopRPM.com/Document.htm • RPM Downloads http://NonstopRPM.com/Download.htm

More Related