1 / 15

Standard-Cell Mapping Revisited

Standard-Cell Mapping Revisited. Alan Mishchenko Robert Brayton (with special thanks to Satrajit Chatterjee and Niklas Een) Department of EECS UC Berkeley. Overview. Introduction Why mapping is a fundamental problem History of technology mapping in ABC

wilmere
Download Presentation

Standard-Cell Mapping Revisited

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Standard-Cell Mapping Revisited Alan Mishchenko Robert Brayton (with special thanks to Satrajit Chatterjee and Niklas Een) Department of EECS UC Berkeley

  2. Overview • Introduction • Why mapping is a fundamental problem • History of technology mapping in ABC • Motivation to continue working on mapping • Technical part • How one new idea led to three new mappers in one year • Several other ideas • Our most recent work • Wrap-up • Preliminary experimental results • Conclusions and future work 2

  3. Pros and Cons of Load-Independent Delay Model • Approximations are inevitable • An approximation is “proper” when it allows us to simplify a problem without missing essential points • In our experience, load-independent delay model is a “proper” approximation • It simplifies mappers, allows them to scale, and leads to next stages when more accurate models are used • Gain-based approach, itself an approximation, enables this delay model • From now on, we use load-independent model

  4. Delay Optimality • Computing best arrival time at each node in a topological order from inputs to outputs ensures that the earliest possible arrival time at the outputs is found

  5. Area Recovery • Two complementary heuristics are traditionally used for area recovery • Global view heuristic (area flow) • Combines area of a cone and fanout count by computing an “average” area per fanout • Local view heuristic (exact area) • Provides a detailed view of each gate and allows mapping to be locally optimized • (These are somewhat similar to global and detailed placement)

  6. Mapper Pseudo-Code • Pre-compute functions implementable using the library • Currently, we only look at single gates (no “super-gates”) • Enumerate cuts for the subject graph • In practice, we enumerate all K-feasible cuts but store only those that have matches with the library • Iterate over the subject graph • Forward passes • First pass computes best delay • Next few passes minimize area-flow under delay constraints • Next few passes minimize exact-area under delay constraints • Backward passes • First backward pass produces a legal mapping to be incrementally improved • All backward passes compute required times • Write out the mapped network

  7. New Idea Used • Keep not one, but two cuts at each node • A delay-oriented cut • Guarantees that the node can meet required times • An area-oriented cut • Allows for area optimization to kick in, if possible • How it impacts the implementation • Different procedure to assign matches • Different way of computing required times • Different procedure to produce a legal mapping • Consequences • The QoR improves • The implementation is more complex but not prohibitively so

  8. Impact on Area Recovery • Area flow computation remains roughly the same • Exact area computation took some time to implement correctly and efficiently • The main difficulty was • Using exact area in the forward pass is prohibitive in terms of runtime • Using exact area in the backward pass requires clever way of propagating required times • After several failures, an efficient implementation was found

  9. Several Other Ideas Used • Integers instead of floating-point numbers can be used to represent timing information • Makes implementation platform-independent • Cuts with matches do not have to be recomputed by the mapper in each round • We precompute and store them to reduce runtime • Area recovery using the exact-area heuristic can be efficiently performed in a reverse topo order • Reduces runtime while keeping the same quality

  10. Comparison of ABC Mappers (delay, before synthesis)

  11. Comparison of ABC Mappers (delay, after synthesis)

  12. Comparison of ABC Mappers (area, before and after synthesis)

  13. Conclusions • Introduced technology mapping • Elaborated on one interesting idea • Reviewed current results and future work

  14. Future Work • Small ideas • Skip dominated matches (2x less runtime) • Select better alternative cuts during area recovery (area 1-2% better) • Big ideas • Use load-dependent delay model • Combine mapping with buffering and sizing

  15. Abstract • Technology mapping is one of the fundamental problems, along with such problems as circuit restructuring and SAT solving. • One flavor of technology mapping looks into minimizing both area and delay, or rather area under delay constraints. • A new approach to building delay/area-aware mappers for standard cells and FPGAs was recently proposed. The main idea of this approach is to store two cuts at each node (delay-oriented and area-oriented) rather than one cut, as in much of previous work. • This presentation surveys our experience developing a standard-cell mapper based on these ideas

More Related