Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Asymmetry Aware Scheduling Algorithms for Asymmetric Processors Nagesh Lakshminarayana Sushma Rao Hyesoon Kim Computer Science Georgia Institute of Technology

Outline • Background and Problem • Application characteristics on AMP/SMP • LJFPF Policy • CJFPF Policy • Conclusion

PEB PEB PEA Interconnect PEB PEB Heterogeneous Architectures • A particularly interesting class of parallel machines is Heterogeneous Architecture: • Multiple types of Processing Elements (PEs) available on the same machine

Special accelerator Multicore CPU + GPU IBM Cell processor Heterogeneous Architectures • Heterogeneous architectures are becoming very common: Focus of this talk Fast core Fast core Slow core Slow core Slow core Slow core Asymmetric Processors

Scheduling Problem: Multiple applications Non-scalable applications Fast core Fast core Slow core Slow core Slow Core Slow core Slow core Scalable applications Fast Core

Scheduling Problem: Multi-threaded application Fast core Fast core Slow core Slow core Slow core Slow core

Problem How to schedule multi-threaded applications on Asymmetric Multiprocessors (AMP)?

Experimental Methodology • Use a 1.87GHz two-socket Quad-core machine to measure the performance • Use SpeedStep technology to emulate an AMP

Performance Results on AMP/SMP

Slow-Limited Applications Fast core Fast core Slow core Slow core Slow core Slow core barrier

Middle-perf Benchmarks Similar to a slow-limited benchmark but sequential section is much longer barrier

Unstable Benchmarks barrier barrier Asymmetric workloads Lots of barriers

PARSEC Benchmarks

Outline • Background and Problem • Applications on AMP/SMP • LJFPF Policy • CJFPF Policy • Conclusion

LJFPF Policy • Longest Job to a Fast Processor First Slow core Fast core Slow core Fast core barrier

How Does the Scheduler Know • Length of work? • Current mechanism: application sends the information • On-going work: Prediction mechanism

Evaluation • Matrix Multiplication Sequential version Parallel version Symmetric workload Parallel version Asymmetric workload

Asymmetric Workload (Matrix Multiplication)

Real Application • ITK (Medical image processing tool kit) • Open source but a real application

Evaluation: MultiRegistration • Kernel loop has 50 iterations 50 % 8 ≠0 • Divide 50 iterations into 7, 7, 7, 7, 6, 6, 5, 5

Results: ITK Benchmark 2.3%

Critical Section Lock Lock

Critical Section Limited Workloads Case (a) Case (b) Critical section Useful work waiting

Critical Section Effects Half-half performs similar to all-fast

CJFPF Policy • Critical Job to a Fast Processor First Policy Fast core Slow core Slow core Slow core

CJFPF Results Longer critical section The benefit of the CJFPF policy decreases

Conclusion • We evaluated the characteristics of multi-threaded applications on AMPs. • Barriers and critical sections are important factors. • Propose two new scheduling policies: Longest job to fast core first (LJFPF), critical job to fast core first (CJFPF) • Scheduling polices improve performance for asymmetric workloads. • Future work • Develop a prediction mechanism • Evaluate symmetric workloads on AMPs • Other kinds of heterogeneous architectures

Thank you!

Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Presentation Transcript

Scheduling Algorithms

Power-aware scheduling

9. Code Scheduling for ILP-Processors

Scheduling Algorithms

Cache Utilization-Aware Scheduling for Multicore Processors

Scheduling algorithms for CIOQ switches

ACCESS: Smart Scheduling for Asymmetric Cache CMPs

Optimal Algorithms for Task Scheduling

Scheduling Algorithms

Energy Aware Real Time Systems - Scheduling algorithms

Temperature-Aware Job Scheduling

Scheduling algorithms for CIOQ switches

Parallel Algorithms for array processors

Scheduling Algorithms

Scheduling Algorithms

Be-Nice Scheduling for embedded SMT processors

Parallel Algorithms for array processors

“Temperature-Aware Task Scheduling for Multicore Processors”

Energy Aware Real Time Systems - Scheduling algorithms

Power-aware scheduling

Approximation Algorithms for Scheduling

Asymmetric Cryptographic Algorithms