1 / 24

Shimin Chen (LBA Reading Group Presentation)

Introspective 3D Chips S. Mysore, B. Agrawal, N. Srivastava, S. Lin, K. Banerjee, T. Sherwood (UCSB), ASPLOS 2006. Shimin Chen (LBA Reading Group Presentation). Motivation. Focus: run-time monitoring for development Tool overhead a mount of analysis at test-time

Download Presentation

Shimin Chen (LBA Reading Group Presentation)

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Introspective 3D ChipsS. Mysore, B. Agrawal, N. Srivastava, S. Lin, K. Banerjee, T. Sherwood (UCSB), ASPLOS 2006 Shimin Chen (LBA Reading Group Presentation)

  2. Motivation • Focus: run-time monitoring for development • Tool overhead amount of analysis at test-time • Previous research: specialized on-chip h/w modules • At odds with economics of consumer microprocessors • May require significant amount of area • Often introduce interconnect congestion • Replicated on every processors whether used or not • Challenge: enabling these techniques with a minimum of impact on typical end-user systems

  3. Solution: Add-On using 3D • Optionally adding a layer to a processor specifically for analysis • Developers: processors with this layer • End users: processors without this layer

  4. Outline • Introduction • Benefits of Introspection in 3D • Quantifying the Technology (Methodology) • Architectural Ramifications (Evaluation) • Conclusion

  5. Benefits of Introspection in 3D • Cutting interconnect impact • Reducing cost for commodity parts • Enabling more powerful software analysis

  6. Cutting Interconnect Impact • Previous: gathering data from all over chip for centralized analysis • Global interconnect • Cross almost every design block • Consume significant top metal layer • Run at high speed • Require wire buffering & even pipeline latches • Reserve silicon for buffers

  7. Cutting Interconnect Impact • Previous: global interconnect • 3D: Area for inter-layer vias localized to positions of taps

  8. Reducing Cost for Commodity Parts • 225 million PCs in use vs. 0.7 million programmers • Need to consider two costs: • Cost of a consumer system: • cirtuit that drives the post and the vertical column of vias • Cost of a developer system: • adding an extra layer

  9. Enabling More Powerful SW Analysis • More h/w resources allocated to analysis • Area • power

  10. Outline • Introduction • Introspection in 3D • Quantifying the Technology • Architectural Ramifications • Conclusion

  11. Posts: 5um x 5um cross 30 - 40 um high (compare normal metal wire: 1um x 1um) Cross Section of 3D Chip

  12. Estimating Interconnect Overhead • Optimal buffer size and inter-buffer separation • 2D interconnect overhead • 3D interconnect overhead • Metalization area

  13. Number of Vertical Posts Estimate that 1024 bits of profile data will be generated per cycle (?)

  14. Gathering Profile Data on Pentium 4

  15. Example HW Monitor 16KB 32KB 32KB 32KB RISC ARM 16KB 130nm technology, area: 16mm2, power: 2.7W

  16. Outline • Introduction • Introspection in 3D • Quantifying the Technology • Architectural Ramifications • Conclusion

  17. Four Types of Systems to Compare • Basic System (Sbase) • System with integrated profiling HW (Sintegrated) • System with profiling HW stacked (Sstacked) • System with profiling stubs (Sstubs)

  18. Routability • Based on Pentium 4 analysis • Sintegrated: • Total wire length=5682.3 mm • Total buffers=~20,000 • Sstacked: • Total buffers=1024 (one per post)

  19. Area for Wires and Buffers

  20. Power

  21. Thermal

  22. Thermal

  23. Conclusion • Economic argument: cost of specialized H/W is decoupled from consumer market • H/W stubs add only 0.021 mm2 area and 0.9% power

  24. Thank you!

More Related