660 likes | 1.17k Views
Computer Aided Analog Circuit Design for Reliability. Elie Maricau and Georges Gielen ESAT–MICAS K.U.Leuven, Belgium emaricau@esat.kuleuven.be. Contents. Introduction Reliability Effect Modeling Reliability Simulation Reliability-aware Design Conclusions.
E N D
Computer Aided Analog Circuit Design for Reliability Elie Maricau and Georges GielenESAT–MICAS K.U.Leuven, Belgium emaricau@esat.kuleuven.be
Contents • Introduction • Reliability Effect Modeling • Reliability Simulation • Reliability-aware Design • Conclusions
Towards Systems-on-Chip (SoC) • Move to increased levels of integration • reduced cost, size/volume, power • improved performance • Increasing chip complexity • integrated heterogeneous systems • mixed hardware/software • mixed RF/analog/digital [Staszewski, ISSCC ‘08]
Scaling to atomistic scale devices… • Nanometer CMOS scaling problems: • Noise problems (signal integrity) • Leakage (digital) • Channel length modulation • Velocity saturation • Mobility degradation • Drain induced barrier lowering (DIBL) • Parasitic effects • IC reliability • … [ITRS 09]
Spatial Unreliability manufacturing process variations random defects IC Reliability … [Chandra IOLTS 09]
the IC manufacturing suffers from defects and from inherent fluctuations results in faulty chips and in fluctuations in circuit performances yield smaller than 100% affects profitability of IC manufacturing process Spatial Unreliability [Bernstein, IBM Journal 06]
Spatial Unreliability manufacturing process variations random defects Dynamic Unreliability workload dependent temperature variations EMC IC Reliability … [Chandra IOLTS 09]
Power, Voltage, Temperature variations EMC EOS ESD … Dynamic Unreliability
Spatial Unreliability manufacturing process variations random defects Dynamic Unreliability workload dependent temperature variations EMC Temporal Unreliability ageing effects IC Reliability … [Chandra IOLTS 09]
IC level Electro migration Stress voiding Bias Temperature Instability (BTI) Hot Carrier Injection (HCI) Time Dependent Dielectric Breakdown (TDDB) ... PCB Corrosion Solder cracking ... Packaging Bond wire sheering ... Temporal Unreliability
Reliability Assessment Device Level Accelerated stress tests on individual devices Device failure criterion is chosen arbitrarily (e.g. DVTH>50mV) Circuit Level Test for Reliability (TFR) e.g. screening, life test, burn-in,… Design for Reliability (DFR) Transistor aging models Reliability simulation tools DFR TFR
Contents • Introduction • Reliability Effect Modeling • Reliability Simulation • Reliability-aware Design • Conclusions
What do we need? • Compact models for all important unreliability effects • Include all important factors • e.g. W,L, Vgs, Vds, T, … • Include interaction effects • e.g. Vds-Vgs for HCI • Cover a broad range of factor values • e.g. Vgs = [0V … 1.5V], W=[0.08mm-10mm] • Model time-varying stress effects • e.g. Vgs(t)= VGS + sin(0.5,1e6)
Reliability in Nanometer CMOS n+ n+ • Process variability • Hot Carrier Degradation • NBTI (PBTI) • Time Dependent Dielectric Breakdown [ITRS 2009]
Process Variability Line edge roughness Random dopant fluctuations [Pelgrom, JSSC 89] [Bernstein et al. IBM Journal 2006]
Hot Carrier Degradation channel hot carrier A well known phenomenon (>25 years) Interface traps due to impact ionization Dominant for NMOS in saturation high VDS high VGS Impact at device level DVTH, Db, Dgo n+ n+
Hot Carrier Degradation • ESAT-MICAS model [Maricau ESREF08] • Based on Reaction-Diffusion (RD) model • Includes all important transistor parameters (Vgs, Vds, L, T) • DC and AC voltage stress
Hot Carrier Degradation [Maricau ESREF08]
Negative Bias Temperature Instability p+ p+ • new phenomenon (<5 years) • important for pMOS • traps due to electro-chemical reaction with SiH • large VGS • temperature activated • relaxation phenomenon • Interface traps: permanent part • Oxide traps: recoverable part • Impact at device level • DVTH, Db, Dgo
Negative Bias Temperature Instability • ESAT-MICAS model [Maricau EL10] • Model permanent (P) and recoverable (R) component • Includes important transistor parameters (Vgs, T) • DC and AC voltage stress
Negative Bias Temperature Instability [Maricau EL10]
Time Dependent Dielectric Breakdown PMOS and NMOS Statistical phenomenon Gate current increases high VGS soft BD – Ig noise hard BD – kW gate resistance tSBD tHBD
Time Dependent Dielectric Breakdown • Soft Breakdown • 65nm technology • Example SBD: • 1V gate stress • 10 year stress time • Time to BD follows a Weibull distribution [Maricau DATE11]
Transistor Reliability in Sub 65nm CMOS • Aging becomes worse • EOT reduces • Eeff increases • New materials (High-k) • PBTI • SiO2 Interfacial Layer • NBTI, HC, TDDB remains [Gielen DATE11]
Transistor Reliability in Sub 65nm CMOS • Everything becomes stochastic • NBTI • PBTI • Hot Carrier degradation • Soft Breakdown • Process variability [Huard, IRPS08]
Bias Temperature Instability • Stochastic BTI model • Individual charges can change DVTH • Poisson distribution for number of trapped charges • (N=mean number of traps) • Exponential distribution for the impact of an individual defect (h = average impact) • DVTH=f(Vgs,T) • s(VTH)=f(1/(WL)) [Gielen DATE11]
Transistor Model [Maricau, ESREF08, DATE11]
Contents • Introduction • Reliability Effect Modeling • Reliability Simulation • Reliability-aware Design • Conclusions
Reliability Simulation • IC Analysis: Performance(t)? • Time-varying stress (Analog!) • Gradual OP shift (iteration in software) • Similar to ELDO and RelXpert [Maricau, DATE09, TCAD10]
Example: LC-VCO • 5 GHz low phase noise • High output swing • High LC-tank Q-factor • Protective gate-capacitors (DC-bias not shown) • UMC 90nm
Nominal Simulation • AC simulation shows sudden Vout degradation (due to go degradation) • No frequency degradation • Failure due to Hot Carrier damage
Variability awareness? Tfail,nom Tfail,20% • Process variability introduces stress variability • Transistor aging + process variability = yield(t) [Maricau TCAD10]
Variability Aware Reliability Simulation Deterministic Reliability Simulation Performance Space (Circuit Dependent) Factor Space (Process Variability) ? Yield (Application Dependent)
Performance Space Exploration ? • Option 1: Monte-Carlo • Monte-Carlo loop around nominal reliability simulation • Chi-square goodness-of-fit to find a good PDF at every time-point • Accurate but very slow • Option 2: Design of experiments (DoE) + Response Surface Model (RSM) • Goal: faster while maintaining accuracy • Means • DoE: make every sample count! • Monte-Carlo on RSM
Variability Aware Reliability Simulation • Factor Space Exploration • Screening • Linear model • Detect interactions • Regression • Interactions • Weak non-linear effects • Polynomial RSM • Residual analysis • Error estimation [Maricau DATE 10]
Stochastic Reliability Simulation • Stochastic Unreliability Effects • Breakdown • BTI in sub 45 nm Technologies • Circuit Aging • Time-dependent transistor parameter shift • e.g. DVTH=f(t) • Time-dependent transistor parameter standard deviation • e.g. s(VTH)=f(t)
Stochastic Reliability Simulation • Treat aging effects as a time-dependent factor • Spatial factors • Time-dependent factors • More factors means longer simulation time! • Factor minimization based on sensitivity analysis [Maricau DATE 11]
Sensitivity Analysis Current Mirror • 8 transistors + 2 resistors: 37 factors • After sensitivity analysis: 25 factors
6-bit Current-Steering DAC • Transimpedance amplifier • Not sensitive to aging • Current sources • SBD effects cause time-dependent transistor mismatch
IDAC Simulation Results [Maricau DATE 11]
Contents • Introduction • Reliability Effect Modeling • Reliability Simulation • Reliability-aware Design • Conclusions
Design for Failure Resilience • Intrinsically robust circuits • Worst-case overdesign to account for aging effects • Consumes extra power and area • Self-healing circuits • adapt circuits at run-time to compensate for the degradation • reconfiguration or tuning of the circuit • digital calibration • required performance is kept, although degradation is present Tools are needed to analyze the circuit at design time and to find adequate solutions!
Intrinsically Robust Circuits [Maricau DATE 10]
Self-healing Circuits Run-time adaptability adapt circuits at run-time to compensate for the degradation reconfiguration or tuning of the circuit required performance is kept, although degradation is present concept of Knobs and Monitors:
Example: A High-Voltage Line Driver • Output driver overview: • Equivalent Model: [Serneels, ISSCC2007]
Example: A High-Voltage Line Driver • Guarantee minimum efficiency • Breakdown monitors [De Wit DRVW08]
Example: A High-Voltage Line Driver • Measurements are ongoing (65nm CMOS)