1 / 6

MOTIVATION AND OBJECTIVE

MOTIVATION AND OBJECTIVE. Discrete Signal Transforms (DSTs) DFT, DCT: major performance component in many applications Hardware accelerated but at high area cost Example: 4P DFT formulation and others Distributed (dedicated) hardware architectures (DHAs)

dolorese
Download Presentation

MOTIVATION AND OBJECTIVE

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. MOTIVATION AND OBJECTIVE Discrete Signal Transforms (DSTs) DFT, DCT: major performance component in many applications Hardware accelerated but at high area cost Example: 4P DFT formulation and others Distributed (dedicated) hardware architectures (DHAs) Partitioning plays key role in performance and resourceoptimization. Partitioning beyond multi-device Multi-core GPPs: IBM CELL BE, Intel Core Duo System-on-Chip: Network-On-Chip Need tools to aid in partition exploration, mapping, implementation  design automation Can we improve partitioning by introducing high-level DST considerations? DST Partitioning Distributed Hardware Architecture

  2. DMAGIC METHODOLOGY DMAGIC = DST Mapping using Algorithmic and Graph Interaction and Computation Architectural Description KPA DST Formulation DST Formulation Hypergraph Formulation To DFG Partition/ Placement DFG Cost and Indicators KPA Formulation Estimators Formulation Manipulator Heuristic Control Rule Selection High-level partition solution 2

  3. TOOLS Kronecker to Graph Tool KTG Operator matrices: Identity, Transform, Permutation, Unitary, Unitary Transpose, Twiddle KPA operations: Tensor product (), Direct Sum ( ), Matrix Multiplication Weighted DFG with level information Graph partitioning/placement algorithm • P/P inspired by Kernighan Lin bipartition heuristic • Extended to k-way partitioning for heterogeneous channels • Cost function sensible to DHA main concerns • DST graphical considerations heuristics Cost Function weight of channel i required comm. through i 3

  4. FORMULATION EXPLORATION Use DST rules to explore space of equivalent formulations in search for one that better suits the target architecture. Combinatorial explosion of the solution space. Find rules amenable to hardware implementation. Conducted experiments to assess the impact of transformations on partition quality. Results used to devise exploration strategy. Objective: Challenges: Approach: Experiments: Effect of inter-column permutations Have effect on solution quality, yet hard to establish heuristic. Effect of node granularity Effect of breakdown strategy using DST decomposition rule 10 10 Greedy ‘top-down’ formulation exploration using breakdown strategy 5 5 2 8 Observations on exhaustive small sized transforms 4 4 4 1 2 2 4 4

  5. RESULTS • Latency Reduction: Average = 23.3%, Peak = 34.1% • Run-Time Reduction: Average = 98.0%, Peak = 99.9% 5

  6. CONCLUSIONS AND CONTRIBUTIONS • Multiple opportunities to improve partitioning by taking advantage of DST and DHA features. • Graph level: regularity of permutations and operations. • Graph partitioning and area estimator can be made more sensible to DHA and DST concerns • Algorithmic level • Reformulation has significant impact on partition quality • Improvements over generic methodology • Latency reduction (23.3% avg, 34.1% max), Runtime (98.8% avg, 99.9% max) • Tools: • KTG: automated and extensible methodology for conversion of Kronecker product algebra (KPA) formulations into DFGs • Architectural model and high-level estimator for the implementation of distributed DSTs • Graph partitioning heuristic for k-way for DSTs to DHAs • New heuristic for exploring DST formulation space • New arbitrary decomposition DCT formulation 6

More Related