Message Passing Programming with MPI

1. Message Passing Programming with MPI Some concepts in message passing programming Introduction to MPI Basic MPI functions Most of the MPI materials are obtained from William Gropp and Rusty Lusk�s MPI tutorial at http://www.mcs.anl.gov/mpi/tutorial/

2. Message passing programming Separate (collaborative) processes are running. No difference between on the same processor or on different processors in terms of correctness. Notice the difference from the OpenMP model. These processes interact by exchanging information. In the message passing model, exchanging information is done explicitly by calling communication routines.

3. Message passing program execution model MIMD (multiple instruction multiple data) model allows different processes. a0.out for process 0, a1.out for process 1, � ai.out for process i. We probably want to do this in most cases, however, this will cause a lot of problems. 1000 executables for 1000 processes run, not scalable. Must specify which process gets which executable, execution interface can be painful. �� SPMD (single program multiple data) model uses one executable All processes run a.out. More closely matching the execution of a sequential program Most systems follow this model.

4. SPMD (single program multiple data) model uses one executable Most of platforms follow this model. MIMD model can be realized by the SPMD model In a.out: If (myid == 0) a0.out If (myid == 1) a1.out �� One can usually do better since most processes in a message passing program do similar things. We will only discuss SPMD message passing programs.

5. Issues in running a message passing program On which processor each process is executed? For the SPMD model, the pool of processors (machines) must be specified. In the ideal case when all machines are the same (Same memory, same CPU, same network connectivity), specifying the pool is sufficient. In a typical parallel environment, the processors (machines) are the same, but the network connectivity is usually not the same.

7. Issues in running a message passing program: The mapping between processes and processors can be very important to the performance. The best mapping is usually platform-specific. Summary: To a run a message passing program The processes (SPMD) The processors (platform) The mapping. Performance issue comes up even at this level. Average programmers do not understand this issue.

8. Even at the very high level (e.g. process mapping), the message passing paradigm has a lot more subtlety than sequential or shared memory programming paradigm. Message passing paradigm is targeted for writing efficient and scalable programs for scalable platforms (> 10000 processing elements). In many cases, anything that affects performance must be considered explicitly.

9. Message passing mechanism Two types of communications Cooperative communications All parties agree to the communication One sided communications Only one process is involved in the communication.

10. Cooperative communications Data must be explicitly send and received. Communications in TCP/IP socket programming belong to this type (Sender must write, receiver must read).

11. One sided communications Can either be remote read or remote write. Only one side is involved in the operations. Data is accessed without waiting for another process.

12. Message Passing Interface (MPI) MPI is an industrial standard that specifies library routines needed for writing message passing programs. Mainly communication routines Also include other features such as topology. MPI allows the development of portable message passing programs. It is a standard supported pretty much by everybody in the field.

13. MPI is both simple and complex. Almost all MPI programs can be realized with six MPI routines. MPI has a total of more than 100 functions and a lot of concepts. We will mainly discuss the simple MPI, but we will also give a glimpse of the complex MPI. MPI is about just the right size. One has the flexibility when it is required. One can start using it after learning the six routines.

14. The hello world MPI program #include "mpi.h" #include <stdio.h> int main( int argc, char *argv[] ) { MPI_Init( &argc, &argv ); printf( "Hello world\n" ); MPI_Finalize(); return 0; }

15. Compiling, linking and running MPI programs MPICH2 is installed on linprog To run a MPI program, do the following: Create a file called .mpd.conf in your home directory with content �secretword=clusterxxx�, where xxx is whatever number Create a file �hosts� specifying the machines to be used to run MPI programs (see linprog_hosts). Boot the system �mpdboot �n 3 �f hosts� Check if the system is corrected setup: �mpdtrace� Compile the program: �mpicc hello.c� Run the program: mpiexec �machinefile hostmap �n 4 a.out Hostmap specifies the mapping -n 4 says running the program with 4 processes. Exit MPI: mpdallexit

16. More on MPI MPICH uses the SPMD model. In order for an SPMD program to emulate MIMD code, the program must know its execution environment How many processes are working on this problem? MPI_Comm_size What is myid? MPI_Comm_rank Rank is with respect to a communicator (context of the communication). MPI_COM_WORLD is a predefined communicator that includes all processes (already mapped to processors). See example2.c.

17. Sending and receiving messages Questions to be answered: To who are the data sent? What is sent? How does the receiver identify the message

18. Socket programming: To who is the data sent? Socket id What is sent? Buffer pointer, continuous byte stream How does the receiver identify the message? Socket id+message order What a programmer typically wants? Identify the peer by a peer id, not a socket id. Send/receive data with flexible types, user defined types Identify messages by sender id, message order, or tag

19. Send and receive routines in MPI MPI_Send and MPI_Recv (blocking send/recv) Identify peer: Peer rank (peer id) Specify data: Starting address, datatype, and count. An MPI datatype is recursively defined as: predefined, corresponding to a data type from the language (e.g., MPI_INT, MPI_DOUBLE) a contiguous array of MPI datatypes a strided block of datatypes an indexed array of blocks of datatypes an arbitrary structure of datatypes There are MPI functions to construct custom datatypes, in particular ones for subarrays Identifying message: peer + tag

20. MPI blocking send MPI_Send(start, count, datatype, dest, tag, comm) The message buffer is described by (start, count, datatype). The target process is specified by dest, which is the rank of the target process in the communicator comm. When this function returns, the data has been delivered to the system and the buffer can be reused. The message may not have been received by the target process.

21. MPI blocking receive MPI_Recv(start, count, datatype, source, tag, comm, status) Waits until a matching (both source and tag) message is received from the system, and the buffer can be used source is rank in communicator specified by comm, or MPI_ANY_SOURCE (a message from anyone) tag is a tag to be matched on or MPI_ANY_TAG receiving fewer than count occurrences of datatype is OK, but receiving more is an error (result undefined) status contains further information (e.g. size of message, rank of the source) See example3a.c for the use of MPI_Send and MPI_Recv.

22. The PI program The summation can be performed in a parallel fashion See example3a.c

23. The Simple MPI (six functions that make most of programs work): MPI_INIT MPI_FINALIZE MPI_COMM_SIZE MPI_COMM_RANK MPI_SEND MPI_RECV Only MPI_Send and MPI_Recv are non-trivial. Everything that can be done by TCP/IP socket programming or UNIX pipe can be realized by these six functions in a more concise manner.

Message Passing Programming with MPI

Message Passing Programming with MPI

Presentation Transcript

Message Passing Programming (MPI)

Message Passing Programming with MPI

7. Message-Passing Parallel Programming with MPI

Message Passing Programming with MPI

MPI – Message Passing Interface

Message Passing Programming Based on MPI

Message Passing Programming (MPI)

MPI: Message-Passing Interface

Message Passing Programming with MPI

MPI: Message Passing Interface

Message Passing Interface (MPI)

MPI Message Passing Interface

Message Passing Interface (MPI)

Programming Clusters using Message-Passing Interface (MPI)

Parallel Programming with Message-Passing Interface (MPI)

Message Passing Interface (MPI)

Programming Clusters using Message-Passing Interface (MPI)

MPI Message Passing Interface

MPI – Message Passing Interface

MPI Message Passing Interface

Message Passing Interface MPI