1 / 38

Practical Approximation Algorithms for Separable Packing LPs

Practical Approximation Algorithms for Separable Packing LPs. F.F. Dragan (Kent State) A.B. Kahng (UCSD) I. Mandoiu (UCLA/UCSD) S. Muddu (Sanera Systems) A. Zelikovsky (Georgia State). Outline. VLSI design motivation Global routing via buffer-blocks Separable packing ILP formulations

jewel
Download Presentation

Practical Approximation Algorithms for Separable Packing LPs

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Practical Approximation Algorithms for Separable Packing LPs F.F. Dragan (Kent State) A.B. Kahng (UCSD) I. Mandoiu (UCLA/UCSD) S. Muddu (Sanera Systems) A. Zelikovsky (Georgia State)

  2. Outline • VLSI design motivation • Global routing via buffer-blocks • Separable packing ILP formulations • PTAS for separable packing LPs • Analysis • Experimental results

  3. Outline • VLSI design motivation • Global routing via buffer-blocks • Separable packing ILP formulations • PTAS for separable packing LPs • Analysis • Experimental results

  4. Outline • VLSI design motivation • Global routing via buffer-blocks • Separable packing ILP formulations • PTAS for separable packing LPs • Analysis • Experimental results

  5. Outline • VLSI design motivation • Global routing via buffer-blocks • Separable packing ILP formulations • PTAS for separable packing LPs • Analysis • Experimental results

  6. VLSI Global Routing

  7. Buffer Blocks VLSI Global Routing Buffered

  8. Problem Formulation Global Routing via Buffer-Blocks (GRBB) Problem Given: • BB locations and capacities • List of multi-pin nets • upper-bound on #buffers for each source-sink path • L/U bounds on the wirelength b/w consecutive buffers/pins Find: • Buffered routing of a maximum number of nets subject to the given constraints

  9. Integer Program Formulation

  10. Integer program changes • Split each BB vertex r of G into two copies, r’ and r’’ • Impose capacity constraint on the sets of vertices {r’,r’’} Enforcing Parity Constraints • Inverting buffers change the polarity of the signal • Each sink has a given polarity requirement • Parity constraints for the #buffers on each routed source-sink path • A path may use two buffers in the same buffer block

  11. Combining with compaction

  12. Combining with compaction

  13. Combining with compaction Set capacity constraints: cap(BB1) + cap(BB2)  const.

  14. Integer program changes • Replace each BB vertex r of G by a set X(r) of vertices (one for each buffer type) • Modify edge set of G to take into account non-uniform driving strengths • Impose capacity constraint on the sets of vertices X(r): GRBB with Buffer Library • Discrete buffer library: different buffer sizes/driving strengths • Need to allocate BB capacity between different buffer types

  15. “Relax+Round” Approach to GRBB • Solve the fractional relaxation • Exact linear programming algorithms are impractical for large instances • KEY IDEA: use an approximation algorithm • allows fine-tuning the tradeoff between runtime and solution quality • Round to integer solution • Provably good rounding [RT87] • Practical runtime (random-walk based)

  16. Outline • VLSI design motivation • Global routing via buffer-blocks • Separable packing LP formulations • PTAS for separable packing LPs • Analysis • Experimental results

  17. Separable Packing LP

  18. Previous Work • MCF and packing/covering LP approximation: [FGK73,SM90, PST91,G92,GK94,KPST94,LMPSTT95,R95,Y95,GK98,F00,…] • Exponential length function to model flow congestion [SM90] • Shortest-path augmentation + final scaling [Y95] • Modified routing increment [GK98] • Fewer shortest-path augmentations [F00] • We extend speed-up idea of [F00] to separable packing LPs

  19. Separable Packing LP Algorithm w(X) , f  0,  =  For i = 1 to N do For k = 1, …, #nets do Find min weight feasible Steiner tree T for net k While weight(T) < min{ 1, (1+) } do f(T)= f(T) + 1 For every X do w(X)  ( 1 +  (T,X)/cap(X) ) * w(X) End For Find min weight feasible Steiner tree T for net k End While End For  = (1+) End For Output f/N

  20. Outline • VLSI design motivation • Global routing via buffer-blocks • Separable packing ILP formulations • PTAS for separable packing LPs • Analysis • Experimental results

  21. Runtime • Choose #iterations N such that all feasible trees have weight 1 after N iterations (i.e.,  1) • Tree weight lower bound is  initially, and is multiplied by (1+) in each iteration Dual LP:

  22. Theorem: For every <.15, the algorithm finds factor 1/(1+4 ) approximation by choosing where L is the maximum number of vertices in a feasible Steiner tree. For this value of , the running time is Approximation Guarantee

  23. Outline • VLSI design motivation • Global routing via buffer-blocks • Separable packing ILP formulations • PTAS for separable packing LPs • Analysis • Experimental results

  24. Implementation choices

  25. Provably Good Rounding • Store fractional flows f(T) for every feasible Steiner tree T • Scale down each f(T) by 1- for small  • Each net k routed with prob. f(k)={ f(T) | T feasible for k } • Number of routed nets  (1- )OPT • To route net k, choose tree T with probability = f(T) / f(k) • With high probability, no BB capacity is exceeded Problem: Impractical to store all non-zero flow trees

  26. use random walk from source to sink Random-Walk 2-TMCF Rounding • Store fractional flows f(T) for every valid routing tree T • Scale down each f(T) by 1- for small  • Each net k routed with prob. f(k)={ f(T) | T routing for k } • Number of routed nets  (1- )OPT • To route net k, choose tree T with probability = f(T) / f(k) • With high probability, no BB capacity is exceeded Practical: random walk requires storing only flows on edges

  27. T3 T1 S T2 Random-Walk MTMCF Rounding SourceSinks

  28. T3 T1 S T2 Random-Walk MTMCF Rounding SourceSinks

  29. The MTMCF Rounding Heuristic • Round each net k with probability f(k), using backward random walks • No scaling-down, approximate MTMCF < OPT • Resolve capacity violations by greedily deleting routed paths • Few violations • Greedily route remaining nets using unused BB capacity • Further routing still possible

  30. Implemented Heuristics • Greedy buffered routing: • For each net, route sinks sequentially along shortest paths to source or node already connected to source • After routing a net, remove fully used BBs • Generalized MCF approximation + randomized rounding • G2TMCF • G3TMCF (3-pin decomposition) • G4TMCF (4-pin decomposition) • GMTMCF (no decomposition, approximate DRST)

  31. Experimental Setup • Test instances extracted from next-generation SGI microprocessor • Up to 5,000 nets, ~6,000 sinks • U=4,000 m, L=500-2,000 m • 50 buffer blocks • 200-400 buffers / BB

  32. % Routed Nets vs. Runtime

  33. Conclusions and Ongoing Work • Provably good algorithms and practical heuristics based on separable packing LP approximation • Higher completion rates than previous algorithms • Extensions: • Combine global buffering with BB planning • Buffer “site” methodology  tile graph • Routing congestion (channel capacity constraints) • Simultaneous pin assignment

  34. % Sinks Connected

  35. Runtime (sec.)

  36. Resource Usage #nets = 4,764 #sinks = 6,038 400 buffers/BB

  37. Resource Usage for 100% Completion #nets = 4,764 #sinks = 6,038 MTMCF wastes routing resources!

More Related