Enhancing MPI Intra-node Communication for Improved Performance in Multicore and Manycore Systems

Smart MPI Intra-node Communicationamong Multicore and Manycore Machines Teng Ma, George Bosilca, Aurelien Bouteiller and Jack J. Dongarra • MPI intra-node communication has big challenge facing the more complex multicore/manycore architecture: more cores, more memory hierarchies and more complex interconnection. • Due to the large number of cores on each node, and the one process per core approach favored by MPI users, improving the intra-node communications has a significant effect on application performance. Multi-tuning framework Kernel assisted collective: KNEM coll • KNEM: a kernel copy module (http://runtime.bordeaux.inria.fr/knem/) • Without intermediate shared memory buffer • Single copy between processes. • Offloading memory copy to non-root processes to avoid sequential copy at root process • Hwloc: find runtime communication pattern (http://www.open-mpi.org/projects/hwloc/) • Rule table: find the best communication parameters set.( OTPO or models) • Runtime parameter setting (a) Tigerton (b) Nehalem EP (a) Tigerton inter-socket (b) Nehalem inter-socket (c) Tigerton intra-socket (d) Nehalem intra-socket (c) Nehalem EX (d) Istanbul Fig 2. Performance comparison of Broadcast Operations between shared memory based modules (Basic, SM and Tuned) and KNEM coll, normalized to the Basic module runtime (lower is better). Fig 1. Bandwidth of ping-pong test for vanilla MPICH2, vanilla OpenMPI and multi-tuning OpenMPI

Enhancing MPI Intra-node Communication for Improved Performance in Multicore and Manycore Systems

Enhancing MPI Intra-node Communication for Improved Performance in Multicore and Manycore Systems

Presentation Transcript

Tuning

Multi-physics Extension of OpenFMO Framework

TUNING

Multi-link Operation Framework

Tuning

Tuning

Trust Framework for Multi-Domain Authorization

NITRO : A Framework for Adaptive Code Variant Tuning

A Framework for Multi-resolution and Multi-touch Systems

Human Multi-level (“Brain Stack”) Framework

Tuning!!!

Multi-Layer Networking An Architecture Framework

Multi-Year Academic Acceleration Plan Framework

Towards Auto-tuning Framework for Numerical Libraries

Multi-Stakeholder Processes A Methodological Framework

Tuning

Tuning

Framework Functionality, Tuning and Troubleshooting

A Framework For Tuning Posterior Entropy

FrAmework for Multi-agency Environments

Tuning