PVM : Parallel Virtual Machine “ The poor man ’ s super-computer ”

PVM : Parallel Virtual Machine“The poor man’s super-computer” Yvon Kermarrec Based on http://www.pcc.qub.ac.uk/tec/courses/pvm/

PVM • “PVM is a software package that permits a heterogeneous collection of serial, parallel and vector computers which are connected to a network to appear as one large computing resource”. • PVM combines the power of a number of computers • PVM may be used to create a virtual computer from multiple supercomputers enabling the solution of previously unsolvable "grand challenge" problems • It is a poor man’s supercomputer allowing one to exploit the aggregate power of unused worksstations during off-hours • PVM may be used as an educational tool to create a parallel computer from low cost workstations

PVM Key features • Easily obtainable public domain package • Easy to install and to configure • Many different virtual machines may co-exist on the same hardware • Program development using a widely adopted message passing library • Supports C / C++ and Fortran • Installation only requires a few Mb of disk space • Simple migration path to MPI

PVM history • PVM project began in 1989 at Oak Ridge National Laboratory - prototype PVM 1.0 – US department of Energy • Version 2.0 was written at the University of Tennessee and released in March 1991 • PVM 2.1 -> 2.4 evolved as a response to user feedback • Version 3 was completed in February 1993 • The current version is 3.3 and subsequent releases are expected

Parallel processing • Parallel processing is the method of using many small tasks to solve one large problem. • 2 major developments have assisted parallel processing acceptance • massively parallel processors (MPPs). • the widespread use of distributed computing - process whereby a set of computers connected by a network are used collectively to solve a single large problem • common to both is the notion of message-passing - in all parallel processing data must be exchanged between co-operating tasks.

PVM • Parallel Virtual Machine (PVM) uses the message passing model to allow programmers to exploit distributed computing across a wide variety of computer types, including MPPs. • When a programmer wishes to exploit a collection of networked computers, they may have to contend with several different types of heterogeneity: • architecture • data format • computational speed • machine load and • network load.

PVM benefits (1/2) • Using existing hardware keeps costs low • Performance can be optimised by assigning each individual task to the most appropriate architecture • Exploitation of the heterogeneous nature of a computation ie provides access to different types of processors for those parts of an application that can only run on a certain platform • Virtual computer resources can grow in stages and take advantage of the latest computational and network technologies • Program development can be enhanced by using a familiar environment ie editors, compilers, debuggers that are available on individual machines

PVM benefits (2/2) • Individual computers and workstations are usually stable and substantial expertise in their use should be available • User-level or program-level fault tolerance can be implemented with little effort either in the application or in the underlying operating system • Facilitates collaborative work

PVM overview • PVM provides a unified framework within which parallel programs can be developed efficiently using existing hardware. • PVM transparently handles all message routing, data conversion, and task scheduling across a network of incompatible computer architectures. • The programming interface is straightforward allowing simple program structures to be implemented in an intuitive manner. • The user writes his application as a collection of co-operating tasks which access PVM resources through a library of standard interface routines.

Underlying principles (1/2) • User-configured host pool: • tasks execute on a set of machines selected by the user for a given run of the PVM program • the host pool may be altered by adding/deleting machines during operation • Translucent access to hardware: • programs may view the hardware environment as an attributeless collection of virtual processing elements or • may choose to exploit the capabilities of specific machines by placing certain tasks on certain machines.

Underlying principles (2/2) • Process-based computation: • A task is the unit of parallelism in PVM - it is an independent sequential thread of control that alternates between communication and computation. • Explicit message-passing model: • Tasks co-operate by explicitly sending and receiving messages to and from one another. • Message size is limited only by the amount of available memory. • Heterogeneity support: • PVM supports heterogeneity in terms of machines, networks and applications. • Messages can also contain one or more data types to be exchanged between machines having different data representations.

PVM supported approaches • Functional Parallelism - an application can be parallelised along its functions so that each task performs a different function eg input, problem setup, solution, output and display. • Data Parallelism - all the tasks are the same but each one only knows and solves a part of the data. This is also known as SPMD(single-program multiple data).

PVM programming paradigm • A user writes one or more sequential programs in C, C++ or Fortran 77 containing embedded calls to the PVM library. • Each program corresponds to a task making up the application. • The programs are compiled for each architecture in the host pool and the resulting object files are placed at a location accessible from machines in the host pool • An application is executed when the user starts one copy of the application ie the `master' or `initiating' task, by hand from a machine within the host pool.

PVM programming paradigm • The `master` process subsequently starts other PVM tasks, eventually there are a number of active tasks to compute and communicate to solve the problem. • Tasks may interact through explicit message-passing using the system assigned opaque TID to identify each other. • Finally once the tasks are finished they and the `master' task disassociate themselves from PVM by exiting from the PVM system.

Parallel models (1/3) • Crowd computation • Tree computation • Hybrid moel

Parallel models (2/3) • The "crowd" computing model - consists of a collection of closely related processes, typically executing the same code, performing calculations on different portions of the workload and usually involving the periodic exchange of intermediate results eg • master-slave (or host-mode) model has a separate `control' program ie the master which is responsible for spawning, initialization, collection and display of results. The slave programs perform the actual computations on the workload allocated either by the master or by themselves • node-to-node model where multiple instances of a single program execute, with one process (typically the one initiated manually) taking over the noncomputational responsibilities as well as contributing to the calculation itself.

Parallel models (3/3) • A `tree' computation model - processes are spawned (usually dynamically as the computation grows) in a tree-like manner establishing a tree-like parent-child relationship eg branch-and-bound algorithms, alpha-beta search, and recursive `divide-and-conquer' algorithms. • A `hybrid' or combination model- possesses an arbitrary spawning structure in that at any point in the application execution the process relationship may resemble an arbitrary and changing graph.

Workload allocation (1/3) • For problesm which are too complex / too big to be resolved on a single computer • Data decomposition • Function decomposition

Workload allocation (2/3) • Data decomposition • assumes that the overall problem involves applying computational operations or transformations on one or more data structures and that these data structures may be divided and operated upon. • Eg if vectors of N elements and P processors are available N/P elements are assigned to each process. • Partitioning can be done either `statically' where each processor knows at the beginning its workload or `dynamically' where a control or master process assigns portions of the workload to processor as and when they become free.

Workload allocation (3/3) • Function decomposition • divides the work based on different operations or functions. • Eg the three stages of a typical program execution - input, processing and output or results could take the form of three separate and distinct programs each one being dedicated to one of the stages. Parallelism is obtained by concurrently executing the three programs and establishing a `pipeline' between them. • A more realistic form of function decomposition is to partition the workload according to functions within the computational phase eg the different sub-algorithms which typically exist in application computations.

Heterogeneity • Supported at 3 levels • Application: tasks may be placed on processors to which they are most suited • Machine: Computers with different data formats, different architectures(serial or parallel), and different operating systems • Network: A virtual machine may span different types of networks such as FDDI, ethernet, ATM

Portability • PVM is available across a wide range of Unix based computers • PVM version 3 may be ported to non-Unix machines • PVM will soon be ported to VMS • Versions of PVM are available on machines such as CCM-5, CS-2 and iPSC/860, Cray T3D

Software components • pvmd3 daemon • Runs on each host and Executes processes • Provides inter-host point of contact • Provides authentication of tasks • Provides fault tolerance • Message routing and source / sink for messages • libpvm programming library • Contains the message passing routines and is linked to each application component • application components • User's programs, containing message passing calls, which are executed as PVM tasks

PVM terminology • Host • A physical machine such as a workstation • Virtual Machine • A combination of hosts running as a single concurrent resource • Process • A program, data, stack, etc such as a Unix process or node program • Task • A PVM process - the smallest unit of computation • TID • A unique identifier, within the virtual machine, which is associated with each task • Message • An ordered list of data to be sent between tasks

PVM message passing • sending a message is a 3 step process • Initialize a buffer using the pvm_initsend function • Pack the data into the buffer using pvm_pack • Send the contents of the buffer to another process using pvm_send or to a number of processes (multi-cast) using pvm_mcast • Receiving a message is a 2 step process • Call the blocking routine pvm_recv or the non-blocking routine pvm_nrecv • Unpack the message using pvm_unpack

Message buffers (1/2) • PVM permits a user to manage multiple buffers but • There is only one active send and one active receive buffer per process at any given moment • the packing, sending, receiving and unpacking routines only affect active buffers • the developer may switch between multiple buffers for message passing

Message buffers (2/2) • In C PVM supports a number of different routines for packing data this corresponds to the different types of data to be packed • Eg, pvm-pkint, pvm_pkdouble

PVM console (1/2) • PVM console is analogous to a console on any multi-tasking computer. It is used to • configure the virtual machine • start pvm tasks (including the daemon) • stop pvm tasks • receives information and error messages

PVM console (2/2) • Starting the console with a Unix command : pvm • The console may be started and stopped multiple times on any of the hosts on which PVM is running • The console responds with the prompt :pvm> • The console accepts commands from standard input

Configuration of PVM (1/2) • Conf • Add • Delete • Spawn • Quit and halt

Configuration of PVM (2/6) • conf • The configuration of the virtual machine may be listed using the conf command • pvm> conf1 host, 1 data format HOST DTID ARCH SPEED navaho-atm 40000 SGI5 1000 • This gives a configuration of 1 host called navaho-atm , its pvmd task id, architecture and relative speed rating

Configuration of PVM (3/6) • add • Additional hosts may be added using the add command • pvm> add sioux-atm mohican-atm2 successful HOST DTID sioux-atm 100000 mohican-atm 140000 pvm> conf 3 hosts, 1 data format HOST DTID ARCH SPEED navaho-atm 40000 SGI5 1000 sioux-atm 100000 SGI5 1000 mohican-atm 140000 SGI5 1000

Configuration of PVM (4/6) • delete • Hosts may be removed from the virtual machine using the delete command • pvm> delete sioux-atm1 successful HOST STATUS sioux-atm deleted

Configuration of PVM (5/6) • spawn • The spawn command may used to execute a program from the pvm console • pvm> spawn -> program-name • This will direct the output from the program to the screen

Configuration of PVM (6/6) • quit, halt • To exit from the console session while leaving all daemons and jobs running use the quit command • pvm> quit • To kill all pvm processes including the console, and then shut down PVM • pvm> halt

Error handling • the completion status of a pvm routine may be checked, if the status >= 0 then the routine completed successfully, a negative value indicates that an error has occurred • When an error occurs, a message is automatically printed that shows the task ID, function and error • Error reports may be generated manually by calling pvm_perror

Debugging • A task run by hand can be started under a serial debugger. Add PVMDEBUG flag to spawn • The following steps should be followed when debugging • Run program as a single task and debug as any other serial program • Run the program on a single host to eliminate network errors and errors in message passing • Run the program across a few hosts to locate deadlock and synchronization problems • Use XPVM

Fault detection • Pvmd will recover automatically, applications must perform their own error recovery • A dead host will remain dead, but may be reconfigured later • The pvmd to pvmd message passing system drives all fault detection • Faults are triggered by retry time-out

Pvm_initsend • Clears default send buffer and specifies message encoding • CALL PVM_INITSEND (ENCODING, BUFID) • encoding - specifies the next message's encoding scheme where the options are: • PvmDataDefault ie XDR encoding used by default as PVM cannot know if a user is going to add a heterogeneous machine before this message • PvmDataRaw ie no encoding used when it is known that the receiving machine understands the native format • PvmDataInPlace ie specifies that data be left in place during packing • bufid - integer returned containing the message buffer identifier. Values less than zero indicate an error.

Pvm_pack* • Each data type is packed by a different function: • int info = pvm_pkint(int *np, int item, int stride) • Arguments are: • a pointer to the first item • nitem - number of items • stride - stride/step through the data • pvm_pkstr - packs a null terminated string (no nitem or stride) • pkbyte, pkcplx, pkdcplx, pkdouble, pkfloat, pklong, pkshort, pkstr

Pvm_pack* • Can be called multiple times in any combination • C structures must be packed by element • Complexity of packed message is unlimited • Unpacking should occur in the same order (and size) as packing (not strictly necessary but good programming practice) • Similar functions exist for unpacking

Pvm_send • Sends the data in the active message buffer • Info = Pvm_Send(Tid, MsTag, Info) • tid - integer task identifier of destination process • msgtag - integer message tag supplied by the user, should be >= 0 • info - integer status code returned by the routine, values less than zero indicate an error.

Pvm_recv • Receives a message : blocking operation • BUFID = PVM_RECV(TID, MSGTAG) • tid - integer identifier of sending process supplied by the user, a -1 in this argument matches any tid ie. a wildcard. • msgtag - integer message tag supplied by the user, should be >= 0, allows the user's program to distinguish between different kinds of messages. A -1 in this argument matches any msgtag ie. a wildcard. • bufid - integer returns the value of the new active receive buffer identifier. Values less than zero indicate an error. • Eg. BUFID = PVMFRECV(-1, 4)

Pvm_bufinfo • Returns information about the message in the nominated buffer. Useful when the message was received with wildcards for TID and msgtag • Info = Pvm_BufInfo (BufId, Bytes,MsTag, Tid) • BUFID - integer buffer id returned by pvmfrecv • BYTES - length of the last message • MSGTAG - integer message tag • TID - TID of sending process • INFO - return status

Pvm_spawn • Starts new PVM processes • Numt = Pvm_Spawn(Task, Flag, Where, Ntask, Tids) • task - character string containing the executable file name of the PVM process to be started • flag - integer specifying spawn options eg PvmTaskDefault which means PVM can choose any machine to start task • where - character string specifying where to start the PVM process • ntask - integer specifying the number of copies of the executable to start up • tids - array of length at least ntask which contains the tids of the PVM process started by this pvm_spawn call • numt - integer returning the actual number of tasks started, values less than 0 indicate a system error. A positive value less than ntask also indicates a partial failure • Eg PVM_Spawn(`nodeprog', FLAG, `sioux-atm', 1, TIDS(3)))

Pvm_mcast • Multicasts the data in the active message buffer to a set of tasks • Info = PVM_MCAST(Ntask, Tds, Mstag) • ntask - specifies the number of tasks to be sent a message • tids - array of length at least ntask which contains the tids of the tasks to be sent the message • msgtag - integer message tag supplied by the user, should be >= 0 • info - integer status code returned by the routine, values less than zero indicate an error.

Compilation • Gcc compiler + reference to pvm librairies • Libpvm • Libgpvm • Usage of aimk : a specific Makefile

Execution • The PVM console command spawn searches for the executable programs in 2 places: • in the user's directory ~/pvm3/bin/$PVM_ARCH • in $PVM_ROOT/bin/$PVM_ARCH • Therefore compiled programs should be moved to ~/pvm3/bin/$PVM_ARCH

Dynamic process group • Group functions are built on top of the core PVM routines • libgpvm3.a must be linked with user programs using any of the group functions • pvmd does not perform the group functions - they are performed by a group server which is automatically started when the first group function is invoked.

Dynamic process group • Any PVM task can join or leave a group at any time without having to inform any other task in the affected groups. • Tasks can broadcast messages to groups of which they are not a member • Any PVM task may call any of the group functions except for – pvm_lvgroup(), pvm_barrier(), and pvm_reduce()

PVM : Parallel Virtual Machine “ The poor man ’ s super-computer ”

PVM : Parallel Virtual Machine “ The poor man ’ s super-computer ”

Presentation Transcript

Virtual Man

Software TPM in a Virtual Machine

Cohesive devices… .

Machine Learning Methods for Human-Computer Interaction

Uzi Vishkin Electrical and Computer Engineering Dept

A New Parallel Framework for Machine Learning

虚拟机技术简介 An Introduction to Virtual Machine

Network

Modeling and Parallel Simulation of Multicore Architectures with Manifold

Distributed Parallel Computing

Prof. Chris Carothers Computer Science Department Lally 306 [Office Hrs: Wed, 11a.m – 1p.m]

Lecture 5: Turning Machine

CS4 Parallel Architectures - Introduction

What’s New in Statistical Machine Translation

Detekcija trkov

Java Virtual Machine

Machine Learning

Managing poor responders in IVF

Softening the Network: Virtualization’s Final Frontier

Computer Organization

Parallel Computing Final Exam Review