TimeGraph : GPU Scheduling for Real-Time Multi-Tasking Environments

TimeGraph:GPU Scheduling for Real-Time Multi-Tasking Environments XiaozhenLi

Agenda The Project Introduction System Model TimeGraph System Architecture TimeGraph GPU Scheduling TimeGraph GPU Reservation Evaluation

The Project

What is TimeGraph • TimeGraph is a GPU command scheduler, running as part of GPU device driver, to protect GPU applications from performance interference. • The main contributions: • Priority-driven scheduling of GPU commands issued from user-space runtime • GPU resource reservation

Project • Main Contributors: • ShinpeiKato:Carnegie Mellon University • KarthikLakshmanan:Carnegie Mellon University • Ragunathan (Raj) Rajkumar:Carnegie Mellon University • Project Main Page: • http://rtml.ece.cmu.edu/projects/timegraph/

Related Publications • TimeGraph: GPU Scheduling for Real-Time Multi-Tasking Environments (USENIX ATC'11) • Resource Sharing in GPU-accelerated Windowing Systems (RTAS'11) • Operating Systems Challenges for GPU Resource Management(OSPERT'11) • RGEM: A Responsive GPGPU Execution Model for Runtime Engines(RTSS‘11)

Introduction

Burgeoning of GPU Applications Real-time multi-tasking requirement

Problem Real-time aplications should be prioritized Important applications must be well-isolated from others

GPU Software Stack Important

What Happens in Multi-Tasking?

Contributions Design, and implement a GPU scheduler to provide prioritization and isolation capabilities for GPU applications Adopts an event-driven scheduler model that synchronizes the GPU with the CPU in a responsive manner Two scheduling policies and two resource reservation policies

System Model

Scope and Limitations A generic multi-core CPU An on-board GPU Independent of libraries,compilers, and runtime engines Been designed and implemented for Nouveau Been ported to the PSCNV open-source driver

Driver Model • Some Important Words • Direct Rendering Infrastructure(DRI) • X Window System • Fermi • Tesla • GPU channels

TimeGraph System Architecture

Components • Scheduling policies • Predictable-Response-Time (PRT) • High-Throughput (HT) • reservation policies • Posterior Enforcement (PE) • Apriori Enforcement (AE)

Specification TimeGraph reads a apecification (/etc/timegraph.spec) every time a new GPU channel is allocated to a task. The format is <name:sched:resv:prio:C:T> Priority Assignment Admission Control

TimeGraph GPU Scheduling

Scheduling Policies---ThePredictable-Response-Time (PTR) When GPU is not idle, GPU commands are queued When GPU gets idle, GPU commands are dispatched

Scheduling Policies--- The High-Throughput (HT)policy When GPU is not idle, GPU commands are queued, only if priority is lower than current GPU context When GPU gets idle, GPU commands are dispatched

Interrupt Setting and Handling • Completion Notifier • Add NOTIFY command to generate an interrupt from the GPU to the CPU • Add the NOP command after the NOTIFY command • Add the SERIALIZE command right before the NOTIFY command

Interrupt Association: • Add the SET_REF command before the SERIALIZE command • Add the SERIALIZE command before the SET_REF command Finally GPU command group: SERIALIZE,SET_REF,SERIALIZE, NOTIFY, NOP. • Task Wake-Up

Conclusion PRT policy make the GPU command groups scheduled based on task priorities to make high-priority tasks responsive on the GPU. HT policy is suitable for such tasks that should execute as fast as possible.

TimeGraph GPU Reservation

Basic Idea • Each task is assigned a reserve that is represented by capacity C and period T. • Budget e is the amount of time that a task is entitled for execution. • Uses a popular rule for budget consumption • the budget is decreased by the amount of time consumed on the GPU, • Replenished by at most capacity C once every period T

Reservation Support– Posterior Enforcement (PE) Policy The command groups to be submitted to the GPU, if their budget is greater than zero The over-run penalty is imposed on the next budget replenishment. The budget for the next period is given by e= min(C, e+C)

Reservation Support–Apriori Enforcement (AE) Policy First predicts a GPU execution. The GPU command group can be submitted to the GPU, only if the predicted cost is no greater than the budget. The next replenishment amount depends on the predicted cost x and the currently-remaining budget e

Conclusion PE policy is aimed for light-weight reservation, allowing tasks to overrun out of their reserves to an extent. Should be Used when isolation is required, AE policy reduces reserve overruns by predicting GPU execution costs a priori at the expense of additional over-head. Should be used if extremely time-critical applications are concurrently executed on the GPU

Command Profiling • Prerequisite • The GPU applications Repeatedly create GPU command groups with the same methods and data sizes. • GPU execution costs depend highly on methods and data sizes, but not on data values

Command Profiling Average GPU execution cost Method

Command Profiling • Preemption Impact • Same GPU command group may consume very different GPU execution times • When the GPU contexts (channels) are switched, GPU execution times can vary • If the context is switched, TimeGraph will not update the average GPU execution cost. Saves the difference between the actual GPU execution time and the average GPU execution cost as the preemption impact. keeps updating it and consider it when measure the predicted cost

Evaluation

Experimental Conditions • Conditions: • NVIDIA GeForce 9800 GT; 1 GB of video mem-ory; Linux 2.6.35 kernel; Intel Xeon E5504 CPU; 4 GB of main memory • GPU-intensive workloads: OpenGL 3-D games • periodic workload: Mplayer • regularly-behaved workload: the Gallium3D Engine demo program • Misbehaving workload: GPU commandbomb • Using SPECviewperf to evaluate the throughput

Coarse-grained Performance

Extreme Workloads

Performance Regulation

Fine-grained Performance

Interference Issues

Periodic Behavior

GPU Execution Cost Prediction

Overhead and Throughput

Thank You

TimeGraph : GPU Scheduling for Real-Time Multi-Tasking Environments

TimeGraph : GPU Scheduling for Real-Time Multi-Tasking Environments

Presentation Transcript

Multi-Tasking Models and Algorithms

Scheduling Algorithms for Multiprogramming in a Hard-Real-Time Environm

A Prediction-based Real-time Scheduling Advisor

Multi-Tasking Models and Algorithms

Chapter 3: Real-Time Scheduling and Schedulability Analysis

Mechanism Design for Real-Time Scheduling

CSE 522 Real-Time Scheduling (3)

CSE 522 Real-Time Scheduling (4)

UNIT - 7 SCHEDULING

CprE 458/558: Real-Time Systems

CPU Scheduling

Chapter 5: CPU Scheduling

Scheduling policies for real-time embedded systems

MULTI-AGENT BASED SCHEDULING

Scheduling

Chapter 6: CPU Scheduling

Chapter 6: CPU Scheduling

Chapter 5: CPU Scheduling

MARS: The Magnet II Real-Time Scheduling Algorithm

Multi-tasking

Chapter 6: CPU Scheduling