Evaluation of Agent Teamwork A High Performance Distributed Computing Middleware
This project focused on evaluating the performance of Agent Teamwork as middleware in high-performance distributed computing environments. Key objectives included comparing job dispatch and termination performance against contemporary alternatives, building a reference platform, and developing benchmark programs. The methodology involved executing tasks across multiple nodes using the MPICH-G2 framework while addressing challenges like machine access, configuration issues, and debugging. The project yielded significant insights into the framework's functionality and highlighted skills developed in working with globus, openPBS, and MPI programming.
Evaluation of Agent Teamwork A High Performance Distributed Computing Middleware
E N D
Presentation Transcript
Evaluation of Agent TeamworkA High Performance Distributed Computing Middleware Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007
What is Agent Teamwork? • HPDC Middleware • Job Dispatch & Termination • Programming Framework • Under Ongoing Development
Project Objectives • Evaluate Agent Teamwork’s performance against a contemporary alternative • Job Dispatch & Termination Performance • Framework Performance • Build a Reference Platform • Write 3 benchmark programs that exercise the framework
Job Dispatch & Termination Performance Evaluation • Globus Based Reference Platform • Globus Toolkit • OpenPBS scheduler • MPICH-G2
Reference Platform Challenges • Administrator Access to Machines • Host Config & Cryptic Error Messages • DNS vs hosts files • Inconsistent hosts files • Inconsistent ptr records • Inconsistent port acls • : globus_init: failed • GTK Authentication
Debugging • Strace • TcpDump • GDB
Job Dispatching and Termination Function Evaluation • Not evaluating the job execution performance • Methodology • Ported available test program to the MPICH-G2 framework • measure how long it takes a job submission to be deployed, executed and cleaned up • Run with 2-64 nodes across the two clusters in a depth-first node distribution series and a breadth-first node distribution series
Framework Function Evaluation • Framework Issues • Agent Teamwork MPI implementation • MPICH-G2 C++ • MPIJava • MPI Framework • Communication functions • Initialization, Barrier, Broadcast, Gather, Scatter, etc. • Goal to write 3 benchmark programs that have communication intensive algorithms.
Benchmark Programs • MD - a molecular dynamics simulation • Wave2D - a wave dissemination simulation • Mandelbrot - a Mandelbrot generator • Code each program twice
Agent Teamwork Programming • Snapshots • Programming model • func_n int func_0 (String[] Args){ … return 1; } int func_1 () { … } • Code Maturity
Future Work • Framework debugging • Develop a pre-processor to convert conventionally programmed code into the snapshot-able func_n model
Skills Developed During Project • Significant experience with globus, openPBS and the mpi • Extensive debugging with tcpdump, strace, and gdb • experience with performance analysis and writing MPI programs • new insights and understanding of HPDC