1 / 10

Hy-C A Compiler Retargetable for Single-Chip Heterogeneous Multiprocessors

Hy-C A Compiler Retargetable for Single-Chip Heterogeneous Multiprocessors. Philip Sweany 8/30/2013. Hybrid Computing. Heterogeneous processors on single chip “ CPU ” FPGA ASIC N “ CPU ” s, M FPGAs, K ASICs Tradeoffs of performance, power, flexibility. Generic Hybrid Architecture.

tavon
Download Presentation

Hy-C A Compiler Retargetable for Single-Chip Heterogeneous Multiprocessors

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Hy-CA Compiler Retargetable for Single-Chip Heterogeneous Multiprocessors Philip Sweany 8/30/2013

  2. Hybrid Computing • Heterogeneous processors on single chip • “CPU” • FPGA • ASIC • N “CPU”s, M FPGAs, K ASICs • Tradeoffs of performance, power, flexibility

  3. Generic Hybrid Architecture CPU 1 FPGA 1 FPGA 2 CPU 2 Shared Memory CPU m FPGA n Multi-CPU Multi-FPGA

  4. Generic Hy-C Tools Source Code Objectives/Constraints System Specification Partitioning CPU Compiler FPGA Synthesis CPU Power-Performance Model FPGA Power-Performance Model Optimization Control

  5. OMAP Resources (old) Veyron Tesla Shared Memory Ducati Multi-CPU

  6. OMAP Processor Resources • Chiron • 2 x 600 MHz (2 symmetric processors each at 600 MHz with shared L2) • Power 600uW / MHz • Tesla • DSP Sub-System (C64x derivative); 400 MHz, 8-wide ILP • Power 200uW / MHz • Ducati • 200 MHz (targeted for control, low latency code) • Power 100uW / MHz

  7. “Canonical” Resources StrongArm C64x Shared Memory WimpyArm FPGA

  8. “Canonical” Processor Resources • StrongArm • 2 x 600 MHz (2 symmetric processors each at 600 MHz with shared L2) • Power 600uW / MHz • C64x • DSP Sub-System (C64x derivative); 400 MHz, 8-wide ILP • Power 200uW / MHz • WimpyArm • 200 MHz (targeted for control, low latency code) • Power 100uW / MHz • FPGA fabric

  9. Hy-C for Canonical Chip Source Code Objectives/Constraints System Specification Partitioning C64x Wimpy Strong FPGA Optimization Control

  10. Open Issue(s) • How should we describe the architecture? • How should we describe the optimization constraints? • How/when shall we implement this beast? • How will we evaluate the “performance” of the generated code?

More Related