180 likes | 195 Vues
HP-MPI. Changqing Tang Senior Engineer Apr. 8, 2008. Agenda. HP-MPI in general OFED support and features Wish list for OFED. Molpro University of Cardiff. ISVs Distributing HP-MPI on Industry-Standard Linux Clusters. Supported Operating Systems. Supported Architectures.
E N D
HP-MPI Changqing Tang Senior Engineer Apr. 8, 2008
Agenda • HP-MPI in general • OFED support and features • Wish list for OFED. HP Confidential
Molpro University of Cardiff ISVs Distributing HP-MPI on Industry-Standard Linux Clusters Supported Operating Systems Supported Architectures Supported Switches HP-MPI: Industry’s Standard Commercial High Performance MPI • Portable, Robust, Fast, Supported, Available • vendor independent MPI optimized for all leading architectures, OSs, switches, applications (see below) • Ideal for Customers • freely distributed by >25 ISVs for industry-standard Linux clusters; for WindowCCS in 2007 • for developers on in-house applications, HP-MPI licenses are included in XC operating environment • Ideal for Independent Software Vendors (ISVs) • reduces platform test matrix; reduces customer complaints; supported by HP experts HP Confidential
Interconnect Hardware Supported • Quadrics QsNet • Myrinet 2G/10G • Infiniband • 10 Gigabit Ethernet HP Confidential
Interconnect Protocol Supported • SHM – Shared-Memory Communication Within Node • TCP – TCP/IP On Every System, IPoIB, etc. • ELAN – Quadrics elan3 & elan4 • GM – legacy message-passing system for Myrinet networks • MX – Myrinet Express • PSM – Pathscale Message on Qlogic IB hardware • VAPI – Mellanox Infiniband Verbs API • IBV – OFED Infiniband Verbs • UDAPL – User Direct Access Programming Library • ITAPI – Interconnect Transport API (HP-UX only) • IBAL – Infiniband Access Layer (Windows only) HP Confidential
HP-MPI Infiniband History • 2.0 -- IT-API Infiniband on HP-UX • 2.1 -- VAPI/uDAPL on Linux Infiniband • 2.2.5 – OFED 1.0 and OFED 1.1 • 2.2.5.1 – OFED 1.2 plus 1.1 and 1.0, work with 1.2.5 and 1.3 • 2.2.7 – Come out this Apr., Support all OFED releases and all major features. • 2.3 – sync with OFED 1.4 HP Confidential
Message Transfer Protocols • RDMA Protocol • Send/Receive Protocol • Shared-Receiving-Queue (SRQ) Protocol HP Confidential
Misc. Features • Connection Fail[over] • Automatic Path Migration (APM) • Message Striping • IB Partitioning • Message Coalescing in 2.2.7 • SHM/NIC/MIX mode within node HP Confidential
Misc. Features (cont.) • CPU Binding – Rank, Cyclic, … • XRC support in 2.2.7 • uDAPL 2.0 support in 2.2.7 (1.1 and 1.2 as well) • iWARP support via uDAPL protocol • One-side operation using RDMA write/read • Distributed lock/unlock using IB atomic operations HP Confidential
Misc. Features (cont.) • Dynamic processes over OFED • Singleton MPI Process • HA HP-MPI, where no teardown if partial ranks exit HP Confidential
Features Available Soon in 2.3 Release • Anyway, OFED 1.4 • iWARP support via rdma_cm + verbs • On-demand connection • IPV6 support • Multi-path IB HP Confidential
Binary Compatible • HP-MPI is not open source, and always dlopen() libibverbs.so • We already have 1.0 and 1.1 code compiled against each version of header file • We detect the version on system and use the appropriate code version HP Confidential
Reliable Datagram (RD) • UD with complicated logic to implement on-demand connection. • RD will simplify MPI code • RDS can benefit from it • Sure there are other wide-range usage. HP Confidential
Reliable Multi-cast • Simplify MPI collective implementation • Improve MPI collective performance HP Confidential
SA/OpenSM feature • A rank can query the port lid on remote node, with knowing the GUID of the remote HCA • Communication path query • Path Recovery Notification • Use to implement network recovery in MPI HP Confidential
Multi-fabrics support • Used in non-stop system • HP-MPI needs different subnets • IP-bonding not work on two subnets ? SW2 SW1 n7 n6 n3 n4 n2 n5 n1 n0 HP Confidential
Thank You ! • http://www.hp.com/go/mpi • Question ? HP Confidential