The Optimum Pipeline Depth for a Microprocessor

The Optimum Pipeline Depth for a Microprocessor Fang Pang Oct/01/02

The choice of the structure of the pipeline is fundamental in the design of a microprocessor. Is there an optimum pipeline depth for a microprocessor that gives the best performance?

There is a tradeoff between the greater throughput of a deeper pipeline and the larger penalty for hazards in the deeper pipeline.This tradeoff leads to an optimum design point.

Two intuitive ways to see that performance will be optimal for a specific pipeline depth : • CPI (Cycles / Instruction) • Cycle time of a processor

The true measure of performance in the processor is theaverage time it takes to execute an instruction. This is the time / Instruction (TPI) , the inverse of the MIPs ( Million Instructions per second ) number.The TPI is just the product of the cycle time and the CPI.

How a Processor spend its time? T = TBZ + TNBZ (TBZ : the time that the execution unit is doing useful work; TNBZ: the time that the execution is stalled by any of pipeline hazards.)

Processor’s busy time (TBZ) TBZ = NI* ts (NI: the number of instructions; ts: the time for an instruction to pass each stage of the pipeline ) ts =to + tp /p (to:the latch overhead for the technology used;tp: the total logic delay of the processor; p:the number of pipeline stages in the design) TBZ = NI* ( to +tp /p)

For a superscalar processor, multiple instructions may be executed at the same time. TBZ = ( NI /a ) * ( to +tp /p) (a: measure of the average degree of superscalar processing whenever the e-unit is busy)

Processor’s not-busy time (TNBZ) Considering each pipeline hazard causes a full pipeline stall TNBZ = NH*tpipe (NH:the number of pipeline hazards; tpipe: the total pipeline delay) tpipe= ts* p = ( to +tp /p)* p = to *p+tp The total pipeline delay is just the product of each pipeline stage delay, ts, and the number of pipeline stages in the processor. TNBZ= NH*( to *p+tp)

Considering each pipeline hazard has its own not-busy time (thazard : each hazard’s not-busy time; h:the fraction of the total pipeline delay encountered by each particular hazard, between 0 and 1) (γ: the fraction of the total pipeline delay averaged over all hazards) TNBZ=NH * (to * p+tp )*γ

Processor time T = TBZ + TNBZ = (NI /a)*( to + tp / p)+ NH*(to * p + tp)* γ NH / NI: depend on the workload being executed and microarchitecture (EX: branch prediction accuracy) a , γ : depend on microarchitecture and the workload to: depends ontechnology tp : depends on technology and microarchitecture

Optimum pipeline depth Popt2= ( NI tp )/ (a NHγto) • When we can have deeper pipeline? • NHdecreases, workloads have fewer hazards. • todecreases, technology reduces the latch overhead, relative to the total logic path, tp. • γdecreases, the fraction of the pipeline that hazards stall decreases.

Simulator result

Optimum pipeline depth’s various dependencies

Dependence on the degree of superscalar processing (a)

Dependence on the degree of pipeline hazard (NH,γ)

Summary A theory has been presented of the optimum pipeline depth for a microprocessor. The theory has been tested by simulating a variable depth pipeline model, and the two are found to be in excellent agreement. It is found that the competition between "storing" instructions in a deeper pipeline to increase throughput and limiting the number of pipeline stalls from various pipeline hazards, results in an optimum pipeline depth.

Discussion

The Optimum Pipeline Depth for a Microprocessor

The Optimum Pipeline Depth for a Microprocessor

Presentation Transcript

Characteristics of a Microprocessor

Validating A Modern Microprocessor

The Microprocessor

MICROPROCESSOR

MICROPROCESSOR

“The Depth”

Optimum

Microprocessor

THE 8086 MICROPROCESSOR

The first microprocessor

Programming the Microprocessor

The Picaxe Microprocessor

IRAM: A Microprocessor for the Post-PC Era

8. The microprocessor

The R10000 Microprocessor

Microprocessor

Microprocessor

Effect of pipeline depth in CPU efficiency

Microprocessor

Programming the Microprocessor

Foods For Optimum Health

IRAM: A Microprocessor for the Post-PC Era