200 likes | 367 Views
Impact of Parameter Variations on Multi-core chips E. Humenay, D. Tarjan, K. Skadron Department of Computer Science University of Virginia. Motivation. Process variations are projected to severely impact the yield of high-performance semiconductors
E N D
Impact of Parameter Variations on Multi-core chips E. Humenay, D. Tarjan, K. Skadron Department of Computer Science University of Virginia
Motivation • Process variations are projected to severely impact the yield of high-performance semiconductors • Multi-core architectures have become the future trend of high-performance chips • Understanding how process variations interact with CMPs is required
Variation Types • PVT Variations • Process • Voltage • Temperature This work primarily focuses on process variations
Process Variations • P variations stem from a variety of sources • Within-Die (WID) • Die-to-Die (D2D) • Wafer-to-Wafer (W2W) • Core-to-Core (C2C)
WID Variations • WID variations can be further sub-divided • Systematic (WIDsys) • Random (WIDrand) • Threshold voltage, Vth, and effective channel length, Leff, are the 2 parameters most susceptible to random variations • Systematic Variations cause parameter values to be spatially correlated • Can be modeled as deterministic or random • WID variations cause C2C variations
Drain Induced Barrier Lowering (DIBL) • Ideally, Vth and Leff values are independent of each other • The DIBL effect introduces a dependency • DIBL causes there to be an exponential dependency between Leff and sub-threshold leakage
Modeling Methodology • In order to estimate the impact of P variations on delay it is necessary to have a critical path (CP) model • Prior CP models vary inputs into RC delay equation for Monte-Carlo analyses. • Simplicity comes at the expense of accuracy.
CP Modeling: Prior Work • Fmax GCP model (Bowman, JSSC ‘02) • Ncp ~ Number of critical paths • Lcp ~Number of gates in critical path (Logic Depth) • Marculescu DAC ’05 • Ncp ~ stage’s device count. Ncp Lcp
Importance of Ncp • As Ncp increases mean delay increases and delay variation decreases Ncp
Modified CP Model • Goal: More accurately describe each functional unit’s delay distribution in order to determine which functional units will affect the final frequency distribution • Improvements • Considering wire delay when determining Lcp • Better Ncp assignments • Importance of Weff:
Modified CP Model • Categorize each stage as being either SRAM or combinational logic • SRAM • L1s • TLBs • Register File • Rename Map • Issue Queue • Logic • Execution Units • Decode Stage • Issue Select
SRAM model • Modified version of CACTI 4.0 is used to estimate fraction of access time susceptible to device variations • Ncp ~ number of read ports • Weff is dependent on unit type • L1 caches are assumed to be optimized for area (minimal sized Weff) • Time critical SRAM units have larger widths (Assume 5x larger than min) • Only consider variation in SRAM access time
Combinational Logic Model • Logic model is based off of Sklansky adder • Delay modeled with Horowitz delay equation • Critical path is carry circuitry • Weff is chosen to alleviate fan-out delay
WIDrand: SRAM delay • Because of large Ncp L1 is likely to be slowest SRAM unit • Nominal Frequency is 3GHz
WIDrand: SRAM vs. Logic • L1 will also be slower than logic
WIDsys Pattern • WIDsys model is derived from actual measurements (Friedberg ISQED’05) Fast, High-leakage Leff 28 POWER4-like core scaled to 45nm 27 14mm 26 Slow, Low-leakage 25 14mm
Impact of WIDsys on Delay • WIDsys can cause frequency from core-to-core to differ by as much as 5% • Large Lcp value causes combinational logic units to be more affected by WIDsys variation
Random Leakage Variation • WIDrand will not have an impact on leakage at the architectural level since total leakage is an aggregate sum Number of Transistors
C2C Leakage Variation • Figure shows core leakage when considering all possible core locations on a die • 3 different magnitudes of DIBL are considered • BSIM suggests .15 (best-case)
Conclusions • L1 caches will determine the WID mean frequency. Variations in other units will not directly affect the frequency distribution • Considering wire delay in CP model causes device variations to have less of an impact on the frequency distribution • WID variations do not result in significant C2C frequency differences • At 45nm, C2C sub-threshold leakage variation may be as much as 45%