Load Balancing and Multithreaded Programming

Load Balancing and Multithreaded Programming Nir Shavit Multiprocessor Synchronization Spring 2003

Multithreaded Fibonacci int fib(int n) { if (n < 2) { return n; } else { int x = spawn fib(n-1); int y = spawn fib(n-2); sync(); return x + y; }} *Cilk Code (Java Code in Notes) M. Herlihy & N. Shavit (c) 2003

Multithreaded Fibonacci int fib(int n) { if (n < 2) { return n; } else { int x = spawn fib(n-1); int y = spawn fib(n-2); sync(); return x + y; }} Parallel method call M. Herlihy & N. Shavit (c) 2003

Multithreaded Fibonacci int fib(int n) { if (n < 2) { return n; } else { int x = spawn fib(n-1); int y = spawn fib(n-2); sync(); return x + y; }} Wait for children to complete M. Herlihy & N. Shavit (c) 2003

Multithreaded Fibonacci int fib(int n) { if (n < 2) { return n; } else { int x = spawn fib(n-1); int y = spawn fib(n-2); sync(); return x + y; }} Safe to use children’s values M. Herlihy & N. Shavit (c) 2003

Note • Spawn & synch operators • Like Israeli traffic signs • Are purely advisory in nature • The scheduler • Like the Israeli driver • Has complete freedom to decide M. Herlihy & N. Shavit (c) 2003

Dynamic Behavior • Multithreaded program is • A directed acyclic graph (DAG) • That unfolds dynamically • A thread is • Maximal sequence of instructions • Without spawn, sync, or return M. Herlihy & N. Shavit (c) 2003

More Notation Watch • Speedup on P processors • Ratio T1/TP • How much faster with P processors • Linear speedup • T1/TP = Θ(P) • Max speedup (average parallelism) • T1/T∞ M. Herlihy & N. Shavit (c) 2003

Addition int add(Matrix C, Matrix T, int n) { if (n == 1) { C[1,1] = C[1,1] + T[1,1]; } else { partition C, T into half-size submatrices; spawn add(C11,T11,n/2); spawn add(C12,T12,n/2); spawn add(C21,T21,n/2); spawn add(C22,T22,n/2) sync(); }} M. Herlihy & N. Shavit (c) 2003

Multiplication int mult(Matrix C, Matrix A, Matrix B, int n) { if (n == 1) { C[1,1] = A[1,1]·B[1,1]; } else { allocate temporary n·n matrix T; partition A,B,C,T into half-size submatrices; … M. Herlihy & N. Shavit (c) 2003

Multiplication (con’t) spawn mult(C11,A11,B11,n/2); spawn mult(C12,A11,B12,n/2); spawn mult(C21,A21,B11,n/2); spawn mult(C22,A22,B12,n/2) spawn mult(T11,A11,B21,n/2); spawn mult(T12,A12,B22,n/2); spawn mult(T21,A21,B21,n/2); spawn mult(T22,A22,B22,n/2) sync(); spawn add(C,T,n); }} M. Herlihy & N. Shavit (c) 2003

Scheduling • Ideally, • User-level scheduler • Maps threads to dedicated processors • In real life, • User-level scheduler • Maps threads to fixed number of processes • Kernel-level scheduler • Maps processes to dynamic pool of processors M. Herlihy & N. Shavit (c) 2003

For Example • Initially, • All P processors available for application • Serial computation • Takes over one processor • Leaving P-1 for us • Waits for I/O • We get that processor back …. M. Herlihy & N. Shavit (c) 2003

Speedup • Map threads onto P processes • Cannot get P-fold speedup • What if the kernel doesn’t cooperate? • Can try for PA-fold speedup • PA is time-averaged number of processors the kernel gives us M. Herlihy & N. Shavit (c) 2003

ideal mm(1024) lu(2048) barnes(16K,10) heat(4K,512,100) Static Load Balancing 8 7 6 5 speedup 4 8-processor Sun Ultra Enterprise 5000. 3 2 1 1 4 8 12 16 20 24 28 32 processes M. Herlihy & N. Shavit (c) 2003

ideal mm(1024) lu(2048) barnes(16K,10) heat(4K,512,100) msort(32M) ray() Dynamic Load Balancing 8 7 6 5 speedup 4 8-processor Sun Ultra Enterprise 5000. 3 2 1 1 4 8 8 12 12 16 16 20 24 28 32 processes M. Herlihy & N. Shavit (c) 2003

Scheduling Hierarchy • User-level scheduler • Tells kernel which processes are ready • Kernel-level scheduler • Synchronous (for analysis, not correctness!) • Picks pi threads to schedule at step i • Time-weighted average is: M. Herlihy & N. Shavit (c) 2003

Load Balancing and Multithreaded Programming

Load Balancing and Multithreaded Programming

Presentation Transcript

Load Balancing Part 1: Dynamic Load Balancing

Load Sharing and Balancing

Predictive Load Balancing

Load Balancing

P4: Multithreaded Programming

LOAD BALANCING SWITCH

LOAD BALANCING SWITCH

Load Balancing

Load balancing

Dynamic Load Balancing

Optimal Load-Balancing

Load Balancing

Load balancing

Load-Balancing

Multithreaded Programming

Load Balancing

LOAD BALANCING SWITCH

Clustering and Load Balancing

Load Balancing and Intelligent Load Balancing

Load Balancing

Selfish Load Balancing