Mixed-Size Placement with Fixed Macrocells using Grid-Warping

Mixed-Size Placement with Fixed Macrocells using Grid-Warping Zhong Xiu*, Rob Rutenbar *Advanced Micro Devices Inc., Department of Electrical and Computer Engineering, Carnegie Mellon University

Placement by Grid-Warping • In [Zhong et al, DAC04], we showed first grid-warping placer • In [Zhong et al, DAC05], we showed our timing-driven placer • Fundamentally new idea for placement improvement • Imagine we place the gates on the surface of a flexible elastic sheet • We stretch the sheet to improve the placement Quadratic Initial placement Warp Placement surface Improved warped result Recurse& descendto continue

Grid Warping: Attractive Features • Novel paradigm for placement: optimize the grid, not the gates • Think of “gravity” – we reshape curvature of space to move the mass • Flexibly nonlinear • Free to warp anyway we like; not driven primarily by linear solves • Low-dimensional optimization problem • We only need to control the sheet, we don’t move gates individually • Early prototypes – WARP1, WARP2 – perform well • Competitive on wirelength with other published placers • As fast – or faster – than many other analytical placers

Organization of this Talk • What’s missing? To handle Mixed-Size Placement • Wirelength/Timing optimization is necessary but not sufficient • We must be able to handle mixed-size placement with fixed macrocells • First, we review basic mechanics of grid-warping • Second, we show how to extend grid-warping for mixed-size placement with fixed macrocells • Finally, we show our results and future directions

Review: Mechanics of Grid-Warping • It’s conceptually useful to think of warping as distorting a regular mesh placed on the elastic placement surface… • ..but this is not actually how we implement warping Quadratic Initial placement Warp Placement surface Improved warped result Recurse& descendto continue

Restore grids and pull gates back Initial placement mass and grids Warp grids and acquire gates We Formulate Warping in an “Inverse” Way • We warp to “acquire” a new set of gates in each unit grid area… • … then “pull” gates back to the undistorted grid, to move them

And We Do Not Use a Regular Warping Grid 2x2 Warping grid 4x4 Warping grid • Instead, we use a grid defined by a set of slicing cuts • It turns out this allows a greater range of motion for the gates • Yes—a lot like quadrisection or partitioning, but more general • The cuts need not be axis parallel • Because gates are fully placed in each region, we get real wirelength

Quadratic Placement Pre-Warping Nonlinear Grid-Warping Loop Improvement Decompose/Recurse Legalize (Domino) Complete Grid Warping Flow • Complete flow has several steps • We review them briefly here

Quadratic Placement Pre-Warping Nonlinear Grid-Warping Loop Improvement Decompose/Recurse Legalize (Domino) Complete Grid Warping Flow • Quadratic place onto elastic sheet • Note: pure quadratic wirelength • No reweighting steps

Quadratic Placement Pre-Warping Nonlinear Grid-Warping Loop Improvement Decompose/Recurse Legalize (Domino) Complete Grid Warping Flow • Geometric pre-conditioning step • Spreads gates out quickly, uniformly, to improve final wirelen

Quadratic Placement Pre-Warping Nonlinear Grid-Warping Loop Improvement Decompose/Recurse Legalize (Domino) Complete Grid Warping Flow • Nonlinear optimizer iteratively perturbs warping grid on sheet

Quadratic Placement Pre-Warping Nonlinear Grid-Warping Loop Improvement Decompose/Recurse Legalize (Domino) Complete Grid Warping Flow stretched • Nonlinear optimizer iteratively perturbs warping grid on sheet • ..each new warping is quickly “stretched” back to a full placement • Use this to eval cost function, which tracks ‘rectilinear wirelen + capacity’

Quadratic Placement Pre-Warping Nonlinear Grid-Warping Loop Improvement Decompose/Recurse Legalize (Domino) Complete Grid Warping Flow • Nonlinear optimizer delivers a final warped placement • Standard improvement step runs hMetis to optimize location of gates placed near partition cuts

Quadratic Placement Pre-Warping Nonlinear Grid-Warping Loop Improvement Decompose/Recurse Legalize (Domino) Complete Grid Warping Flow • Recurse: in this case, 4 new placements inside 4 regions • Continue until ~few gates/region

Quadratic Placement Pre-Warping Nonlinear Grid-Warping Loop Improvement Decompose/Recurse Legalize (Domino) Complete Grid Warping Flow • Warping flow delivers a final, but still slightly illegal, placement • Use Domino (T.U. Munich) to legalize to final detailed placement

Problem: Warped Placement with Macrocells • Assumptions • We focus on the fixed-macro case • The core problem • Warping is intrinsically weak at separating large macros and small gates • All instances modeled as points; elastic “stretching” keeps nearby points close

Handling Fixed Macrocells • 4 new geometric solutions • Inside warping, a geometric “hash” function that greedily re-locates gates that warp on top of macrocells • …and a new net model (QP) • During partition improvement, closer attention to size imbalances • New backend (FastPlace) Quadratic Placement Pre-Warping Nonlinear Grid-Warping Loop Improvement Re-Warping Decompose/Recurse Legalize (FastPlace)

Problem: nonlinear warping drops gates on top of fixed macros Solution: “hash” them off, inside warping loop Inside warping, inside each eval of global cost func, check if each cell overlaps macro If so, we push it to the nearest boundary that has enough space for the cell We chop chip up into small grids, store “nearest boundary” info in a hash table Note No attempt to manage density or wirelength in this solution, just legality (1) Geometric Hashing During Warping M M M M

If a column (row) contains two or more cells, a new cell is introduced and is connected to all the internal cells and all the propagated pins. If a column (row) contains only one cell, star model is used and no additional variables is introduced. (2) New Net Models • 1st QP & QPs in re-warping • For 2-pin nets, use the clique model, and set weight to 1. • For nets with 3 or more pins, use star model and introduce a new variable, each net has a weight: #Pins/(#Pins - 1). (FastPlace, ISPD’04) • 2nd QP and beyond • At 2nd layer of QP and beyond, use Jens Vygen’s net-split technique • Hybrid models, make QP faster and conserve quality

(3) Better Consideration of Capacity • Get rid of “Pre-Warping” • Cost Function: use a single-sided cost function, under-filled regions receive no capacity penalty • hMetis: the total area of all cells in each region does not exceed the capacity of that region • Re-Warping: all of the above mentioned modifications are applied here Quadratic Placement Pre-Warping Nonlinear Grid-Warping Loop Improvement Re-Warping Decompose/Recurse Legalize (FastPlace)

Use ideas from FastPlace [ICCAD’05] Swap based detail placement algorithm – greedy algorithm Find the “optimal region” for a cell, if the cell is not in this region, try to swap it with a cell or space in that optimal region Detailed backend flow First pass legalize FengShui 5.1 from SUNY Binghm Local repair for small macros/overlaps Local greedy swaps Wirelength minimization Chu’s FastPlace legalizer ideas Our new overall flow Run WARP as global placement Sort all the cells and ~legalize them Run the new detailed backend (4) New Backend

Macrocell Results – ISPD’02 • Wirelength • 3-4% better than Feng Shui • Competitive with BonnPlace • Run time • 2X Fengshui • ~ Competitive with BonnPlace • Exact comparison difficult – BonnPlace is running on an 1.45GHz IBM 4-processor server and is explicitly parallelized software • We’re on a 2.0GHz Xeon.

ISPD’02 Layouts ibm01 ibm04

More Macrocell Results – ISPD’05 • Versus APlace (Kahng@UCSD) • ~7% more wirelen • Total 41.75 hours for all 6 designs for Warp on 2.8GHz Xeon • Aside: ISPD’05 contest results: • APlace 1.00 • mFAR 1.06 • Dragon 1.08 • mPL 1.09 • FastPlace 1.16 • Capo 1.17 • NTUP 1.21 • Fengshui 1.50 • Kraftwerk+Domino 1.84

ISPD’05 Layouts adaptec2 adaptec4

ISPD’05 Layouts bigblue2 bigblue3

Conclusion and Future Work • Placement with macrocells – WARP3 – competitive • New techniques such as geometric hashing, improved hybrid net model • Can produce very good quality mixed-size placements reasonably quickly • Future Work • Improve both quality and runtime • Better handle macrocells and routing congestion • “Hybrid” layout strategies (warping, but a flatter, analytical style)

Mixed-Size Placement with Fixed Macrocells using Grid-Warping

Mixed-Size Placement with Fixed Macrocells using Grid-Warping

Presentation Transcript

Image Warping using Empirical Bayes

Reliable and Efficient Grid Data Placement using Stork and DiskRouter

Fixed, Random, and Mixed Effects ANOVA

Warping

MAPLE: Multilevel Adaptive PLacEment for Mixed-Size Designs

Routability Driven Analytical Placement for Mixed-Size Circuit Designs

A Size Scaling Approach for Mixed-size Placement

An Analytical Placer for Mixed-Size 3D Placement

Mixed-Size Placement with Fixed Macrocells using Grid-Warping

Handling Complexities in Modern Large-Scale Mixed-Size Placement

An Effective Clustering Algorithm for Mixed-size Placement

Computational grid size

Video Frames Interpolation Using Adaptive Warping

Optimizing Routability in Large-Scale Mixed-Size Placement

Grid Size

Constraint Graph-Based Macro Placement for Modern Mixed-Size Circuit Designs

An Analytic Placer for Mixed-Size Placement and Timing-Driven Placement

Warping

Warping

Using Agile Approach with Fixed Budget Projects

An Analytic Placer for Mixed-Size Placement and Timing-Driven Placement

Reliable and Efficient Grid Data Placement using Stork and DiskRouter