390 likes | 489 Views
The ABOVE (Angle-Based On-Chip Variation Estimation) Technique for a Process Variation-Immune Design. Rasit Onur Topaloglu University of California San Diego rtopalog@cse.ucsd.edu.
E N D
The ABOVE (Angle-Based On-Chip Variation Estimation) Technique for a Process Variation-Immune Design Rasit Onur Topaloglu University of California San Diegortopalog@cse.ucsd.edu Dennis Lau National Semiconductordennis.lau@nsc.com Hosam HaggagNational Semiconductorhosam.haggag@nsc.com
Rasit Onur Topaloglu Outline • Motivation • Process and on-chip variation model • Simulation Cost Reduction • Validation methodology • Experimental Results • Conclusions • Implementation
Motivation Rasit Onur Topaloglu • Logic path correlations are accounted for, but it is not satisfactory Cell locations contain a valuable systematic information • Process variations, if not considered properly, may cause chips to fail or prone designs to be impossible to attain a spec. Increases design time and reduces yield • Wafer sizes increasing, control in manufacturing progressing and feature sizes shrinking A simplistic process model is possible • Currently, the Synopsys static timing analysis tool, PrimeTime, neglects cell locations by default
Rasit Onur Topaloglu Oxide Distribution on Wafer 30cm wafer, 0.13m, SEM Oxide distribution seems to be circular & continuous Ref: Intel Technology Journal, Vol. 06, Issue 2, May 2002
Rasit Onur Topaloglu Process and OCV Models
Rasit Onur Topaloglu 1.05 1.1 1.2 1.15 Modeling of Process Variations : “Volcano Model” Equ-speed circles on wafer Cell speed Cell speed Distance from center Distance from center • Variation curves are circular • Increasing or decreasing cell speeds along radius • Effects such as oxide variation, threshold voltage variation, lumped as cell speed variation
Rasit Onur Topaloglu Acquisition of the Volcano Model • Design companies create their own test chips with different library blocks arranged in an oscillator or pulse time measurement device .. 1 1 1 1 1 • The test chips provides average cell speeds for each cell type on a particular region of wafer wherever the test chip is located • Test chips are replicated over wafer • A statistical analyses such as least squares fit can then be used to fit the model on data to identify the (DC) distribution center, following a normalization step to set cell speed at DC to 1
Rasit Onur Topaloglu Second Order Extensions to the Model • If test chips are small, a large number of them can be replicated over the wafer • It becomes possible to identify more than one distribution center • Elliptic contours may turn out to be more suitable
Rasit Onur Topaloglu The ABOVE Technique • Particular location where a chip sits on wafer not known before fabrication • The idea is to place a chip on particular locations on wafer and make sure that designing for these locations sufficient although the real chip might fall somewhere else on wafer • The ABOVE technique is developed assuming Volcano model is valid; it can potentially be migrated to other process models
Rasit Onur Topaloglu Virtual Locations • Simulated locations shall not necessarily correspond to actual locations allocated for chips on wafer
Rasit Onur Topaloglu Dominant Angles • Simulated locations usually will be on different sides of the distribution center, hence these locations can be identified by an angle with respect to the DC (distribution center) • The reason is that chips on these locations will have process variations effecting them from different directions fast cells DC slow cells • Notice how chips on different places have differing speed variations on them
Rasit Onur Topaloglu Process curves on chip chip1 -model -real 1 2 chip2 • Since chips are small, circle arcs approximated to be straight lines Linear Approximation of Process Variation Curves DC 1 • On-chip variation available as std. dev. only Assumption : max. on-chip speed variation • Cell speeds will be effected depending on angle wrt DC
Rasit Onur Topaloglu Location of Chip Matters max deviation 0 deviation DC • Equ-lines are taken to be parallel to each other and normal to the line • that connects distribution center and closest corner of chip • Each cell assigned a speed deration such that nearest and farthest cell has 0 and maximum deviations from nominal respectively
Rasit Onur Topaloglu Calculation of Speed Variation p A B |A//B| / |B| ratio is used to find process variation effect at location p • Multiply this ratio by maximum on-chip variation to find cell speed
Rasit Onur Topaloglu Simulation Cost Reduction
Rasit Onur Topaloglu B A 1.05 1.1 1.15 1.2 Hypothesis I : Chips on Same Angle • It is satisfactory to check the location that covers more contours as compared to other locations on the same dominant angle • In the example, location A&B on same dominant angle, but it is satisfactory to check B as it contains more contours lines
Rasit Onur Topaloglu Hypothesis II : Dominant Angles • There exist a number of angles for which the timing check of a chip at virtual locations at these angles will satisfy the check of all remaining angles in between them • In the example, 8 dominant locations checked, it is not necessary to check the location indicated by the purple box.
Rasit Onur Topaloglu Hypothesis III : Equivalence of Complementary Angles 2 3 1 4 8 5 7 6 • Checking for increasing process variation at an angle is equivalent to checking decreasing across-chip variation at its complementary angle • In the example, checking for increasing process variations at location 1-2-3-4 is equivalent to checking for decreasing process variations at locations 5-6-7-8
Rasit Onur Topaloglu Hypothesis IV : Passiveness of Common Variations 1.1 1.32 1.05 1.26 1 1.2 • Spatial variations are dominant in terms of resulting in worst case estimations as compared to common variations in all cells • In the example, process contours are normalized with respect to 1.2, which is the common variation of the cell. Checking either cell is satisfactory
Rasit Onur Topaloglu Choice of Dominant Angles • For a first order approximation, angles can be chosen as equally distributed • In our experiments, we have used 8 angles separated by 45o each 2 3 1 4 8 5 7 6 • An analytical solution needs to be implemented
Rasit Onur Topaloglu DC1 DC2 Extending ABOVE to include Second Order Modeling Effects • For models with more than one DC, a chip can be affected by various directions • A cross-over line is assumed for certain locations, number of simulations increase • In the example, more locations would appear on the cross-over line
Rasit Onur Topaloglu Test and Validation of Proposed Methods
Rasit Onur Topaloglu Comparison with Probabilistic Cell Speeds For each dominant location angle { Run script that changes cell speeds of a chip using a uniform distribution given max. on-chip variation Run script that changes cell speeds of a chip using a Gaussian distribution given max. on-chip variation Compare minimum setup times and hold times with a location based deterministic run } Used to show that location based variations can be deteriorating as compared to probabilistic models due to systematic variation
Rasit Onur Topaloglu Proof I : Checking Validity of Method for Chips on Same Angle For a number of variations up to max on-chip variation { Run script that changes cell speeds of a chip at angle } Compare minimum setup times and hold times runs Used to show that for chips at same angle, simulating the chip that includes most contours is satisfactory
Rasit Onur Topaloglu Proof II : Checking Validity of Dominant Angles For a number of angles other than dominant angles { Run script that changes cell speeds of a chip given that angle Check that minimum setup or hold times are higher than found using dominant locations } Used to show that simulating for chips at dominant locations satisfactory for making sure that it will work on any location
Rasit Onur Topaloglu Proof III : Checking Validity of Complementary Angles For each dominant angle { Compare results of ABOVE with the complementary angle } Used to show that simulating for chips at complementary locations with differing magnitudes is equivalent
Rasit Onur Topaloglu Proof IV : Checking Validity of Common Variations For a number of common variations { Compare results and check if they give reasonably close results } Used to show that simulating for chips need not be done for various common variations
Rasit Onur Topaloglu 0.1242 when less variation used • Hypothesis I supported Experimental Results Setup (max delay) 0.1243 Nominal Location based 0.1001 Uniform random 0.1206 Gaussian random 0.1234 • Up to 20% variation in minimum slack observed on a microprocessor testcase • Or, try setting clock to 1GHz whereas your chip can run @ 800MHz on most locations on wafer
Rasit Onur Topaloglu 1.74 3.92 5.15 1.74 -1.70 -3.29 1.71 1.40 0.53 0.53 max delays paths max delays paths for setup max data min clock delays for setup overest. underestimate Where Location Based Method fits in PrimeTime? WC TYP BC BC/WC OCV Setup (max delay) Hold (min delay) Location based falls here, more realistic than both directions
Rasit Onur Topaloglu A Qualitative Comparison with Pre-existing OCV Methods • PVT effects are considered as being lumped similarly • Either a bounding-box or a pair-wise deration used while taking capture blocks are reference. Both approaches are local solutions with increased computation; ABOVE brings a global approach by taking DC’s as reference • [https://solvnet.synopsys.com/retrieve/013562.html]
Rasit Onur Topaloglu Conclusions • Dominant locations provide a means to reduce simulation time, yet integrate more accurate process variation effects • Probabilistic models fail to be satisfactory as they neglect deterministic systematic relationship between cells • Location based variation fits on a more realistic scale as compared to built-in PrimeTime methods
Rasit Onur Topaloglu Future Directions • Proper selection of dominant locations • Incorporation of interconnect delay variations A layout based mathematical approach
Rasit Onur Topaloglu Implementation
Usage of Astro Output Rasit Onur Topaloglu #this PERL script is used to input Astro data from the files cells.txt and locations.txt and output cellnloc.txt file that will be read by PrimeTime to update cell speeds. #!\bin\perl open (INFILE,"cells.txt"); open (IN2FILE,"locations.txt"); open (OUTFILE,">cellnloc.txt"); $line1=<INFILE>; @lines2=<IN2FILE>; $file2length=$#lines2; $_=$line1; s/{//; s/}//; @cellnames=split(/, /,$_); $maxcellno=$#cellnames; #shows one less than total in fact print "maxcellno=$maxcellno "; $found=1;#for reporting an assertation when cells are not found in locations.txt taken from Astro floorplan for ($i=0;$i<=$maxcellno;++$i) { foreach $line (@lines2) {$_=$line; if (/$cellnames[$i]/) { ($discard,$location)=split(/'/,$_); push(@locations,$location); print OUTFILE "$cellnames[$i]'$location"; $found=0; last; } } if ($found) {print OUT2FILE "$cellnames[$i]";} $found=1;}
Rasit Onur Topaloglu Example of Output for Previous Step # The following is example entries in the text file cellnloc.txt acquired through previous script "U1"'(121.48 267.47) "U11"'(151.38 392.93) "U12"'(189.56 378.17) "U13"'(150.46 444.59) "U14"'(166.56 319.13)
Creating PrimeTime Script Rasit Onur Topaloglu #The following PERL script is used to use cellnloc.txt and create a PrimeTime script to be sourced by the main PrimeTime script to update cell speeds according to the ABOVE technique. #!\bin\perl open (INFILE,"cellnloc.txt"); #maximum on-chip variation of 20% is specified here $maxdiagder=0.2; #a dominant angle is specified here in radians $angle=0.78537; #chip size specified here according to Astro output $wid=811.06; $len=808; #set to 0 for slow operating mode my $fast=0; #constant definitions my $pi=3.1415; my $piover2=$pi/2; my $piover4=$pi/4; open (OUTFILE1,">derate_cell.pt"); @lines=<INFILE>; foreach $line1 (@lines){ #strip away left and right parentheses $line1=~ s/\(//; $line1=~s/\)//; $line1=~s/ /'/; $_=$line1; ($cname,$loc1,$loc2)=split(/'/,$_);
Rasit Onur Topaloglu Implementing Operations According to a Dominant Angle if ($angle <=atan2($len,$wid)){ $refx=$wid; $refy=$wid*sin($angle)/cos($angle); $hyporef=sqrt($refx*$refx+$refy*$refy); $hypoB=(($refy)*($len-$refy)/$hyporef)+$hyporef; $Bx=cos($angle)*$hypoB; $By=sin($angle)*$hypoB; } elsif ($angle <=$piover2){ $refx=$len*cos($angle)/sin($angle); $refy=$len; $hyporef=sqrt($refx*$refx+$refy*$refy); $hypoB=(($refx)*($wid-$refx)/$hyporef)+$hyporef; $Bx=cos($angle)*$hypoB; $By=sin($angle)*$hypoB; } . .
Rasit Onur Topaloglu Implementation of Projection and Normalization #dot product of A and B $dotprod=($loc1*$Bx)+($loc2*$By); #x value for A projected on B $projx=$dotprod*$Bx/($hypoB*$hypoB); #y value for A projected on B $projy=$dotprod*$By/($hypoB*$hypoB); #length of A projected on B $line=sqrt ($projx*$projx+$projy*$projy); #length of reference line if ($fast==1){ $derval=1+($line/$hypoB)*$maxdiagder; } else { $derval=1-($line/$hypoB)*$maxdiagder; } print OUTFILE1 "set_timing_derate $derval $cname\n"; close INFILE; close OUTFILE1;
Rasit Onur Topaloglu Output of Previous Step # The following is example entries in the PrimeTime file derate_cell.pt created by the script above. set_timing_derate 1.02995586023229 "U1" set_timing_derate 1.03732892757626 "U11" set_timing_derate 1.04674376741548 "U12" set_timing_derate 1.03710206396567 "U13" set_timing_derate 1.04107217715089 "U14"