210 likes | 325 Vues
Stay informed with the latest updates from the National Partnership for Advanced Computational Infrastructure (NPACI). This collaboration, funded by the US National Science Foundation, integrates resources from leading research institutions, including the San Diego Supercomputer Center (SDSC), Texas Advanced Computing Center (TACC), and the University of Michigan. Key enhancements include upgraded server architecture, improved data collection methods, and advancements in GridPort technologies. Explore extensive computational and storage resources, featuring thousands of CPUs and petabytes of storage, to support research and innovation.
E N D
NPACI HotPage Updates • Kurt Mueller • San Diego Supercomputer Center • kurt@sdsc.edu
NPACI National Partnership for Advanced Computational Infrastructure • US National Science Foundation funded collaboration between research institutions: • San Diego Supercomputer Center (SDSC) - University of California, San Diego • Texas Advanced Computing Center (TACC) - University of Texas, Austin • Center for Advanced Computing (CAC) - University of Michigan • 46 other institutions...
NPACI resources • compute • SDSC Blue Horizon - 1152 CPU IBM SP • TACC Longhorn - 224 CPU IBM Power4 • TACC Aurora - 16 CPU Cray SV1 • CAC Morpheus, Hypnos - 256, 134 CPU AMD clusters • CAC 64 and 24 CPU IBM SP systems
NPACI resources • storage • SDSC HPSS - 6 petabytes • TACC DMF - 1.4 petabytes • CAC ADSM - 41 terabytes • Caltech HPSS - 323 terabytes
aggregate resources • 463 nodes • 1854 CPUs • 4345 peak GFlops • 1714 GB RAM • 7.7 petabytes tape storage
HotPage updates • moved to Apache on linux • underlying GridPort updated to 2.3 • object-oriented perl • module packaging • simplified installation • new features: NWS, etc.
data collection • old methods of data collection: • cron jobs on webserver, triggering ssh commands to execute remote scripts on resources and store data on webserver • Globus job run to execute remote scripts • GIS queries for status data
data collection • problems with old methods: • some scripts generated LDIF, some HTML • no separation of content and presentation • GIS support inconsistent across partner sites • no persistent storage of data - historical record
GP-IR data repository • GridPort Information Repository • developed at TACC, with help from SDSC • XML-centric database to store performance and other data • web services data ingester and query interface
GP-IR data example • <?xml version="1.0" encoding="UTF-8" ?> <loadInfo xmlns="http://grids.tacc.utexas.edu/schemas/infoservices/load" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://grids.tacc.utexas.edu/schemas/infoservices/load/load.xsd" timestamp="2001-12-17T09:30:47-05:00" hostname="horizon.sdsc.edu"> • <load> • <type>queue</type> <Percent>87</Percent> • </load> • </loadInfo>
GP-IR data caching • cron jobs run on HotPage server to query GP-IR every five minutes and write out xml to cache directories • HotPage page requests generate live queries to GP-IR • if live queries fail, HotPage scripts load cached XML
grid view • summary table • grid monitor • NWS
to do • historical performance queries • portlets - presentation this week • customization, portability • GridPort 3.0 - ditto • OGSA • HotPage source code distribution - this week? • rollout technologies to PACI HotPage • NPACI, NCSA and PSC resources
links • https://hotpage.npaci.edu/ • http://www.npaci.edu • http://www.sdsc.edu • http://www.tacc.utexas.edu • GridPort http://gridport.npaci.edu • GP-IR http://www.tacc.utexas.edu/grid/gpir/