1 / 30

Weng-Fai WONG 黄荣辉 Dept. of Computer Science National University of Singapore

Performance and Energy Bounds for Multimedia Applications on Dual-processor Power-aware SoC Platforms. Weng-Fai WONG 黄荣辉 Dept. of Computer Science National University of Singapore. Joint work in collaboration with Zhu Yongxin, Samarjit Chakraborty. Background.

Download Presentation

Weng-Fai WONG 黄荣辉 Dept. of Computer Science National University of Singapore

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Performance and Energy Bounds for Multimedia Applications on Dual-processor Power-aware SoC Platforms Weng-Fai WONG 黄荣辉 Dept. of Computer Science National University of Singapore Joint work in collaboration with Zhu Yongxin, Samarjit Chakraborty

  2. Background • SoC platforms become more complicated than classic embedded systems by carrying out multiple tasks: • to record music received by software radio • to play games while downloading another one • to talk over GPRS/3G mobile phone which stays online checking emails • …. • Needs to quickly explore design space of SoC for multimedia processing • Emergence of multi-core technology

  3. Background • Analytical approaches are necessary due to unacceptable overheads of simulation practices to study multiple design tradeoffs • Many efforts for performance enhancement to ensure the quality of service such as a guaranteed playback rate • A few power-awareness efforts • dynamic voltage and frequency scaling (DVFS) • dynamic power management (DPM)

  4. A Motivating Problem • Under a performance constraint, how to minimize energy dissipation by trading off among: • dynamic frequency and voltage scaling policies, • multiple frequencies of processors, • processor customization catering for applications

  5. Related Work • Yanhong Liu, Alexander Maxiaguine, Samarjit Chakraborty, and Wei Tsang Ooi. Processor frequency selection in energy-aware SoC platform design for multimedia applications. RTSS 2004. • Alexander Maxiaguine, Yongxin Zhu, Samarjit Chakraborty, and Weng-Fai Wong. Tuning soc platforms for multimedia processing: Identifying limits and tradeoffs. CODES+ISSS 2004 • L. Cai and Y.H. Lu. Energy Management Using Buffer Memory for Streaming Data, IEEE Transactions on Computer-aided Design of Integrated Circuits and Systems, 24(2):141-152, 2005 • Validation of the models against simulation results and metrics on physical processors • Co-optimization of performance and power

  6. Methodology • Network calculus models to identify the upper and lower bounds using variability characterization curves

  7. Variability Characterization Curves • Workload curves • Consumption curves

  8. Variability Characterization Curves • Production curves • Service curves • number of available cycles, subject to schedulers such as duty cycles • Number of activations

  9. Power Model • Active time • where Li is the length of activation on the i-th PE, Ωi is the frequency of the PEi • Leakage power • where Isubnis the sub-threshold current, Vbs is the body bias voltage, and Ij is the reverse bias junction current • Switching overhead • where ρi is the scheduling period of PEi, Dwakeup is the wake-up delay, pidlep,i is the dynamic power of PEi in the idle mode

  10. Power Model (cont’d) • PE’s energy • Buffer’s energy • where Qmaxi is the maximum buffer fill level of the i-th buffer, pbi is the i-th buffer’s dynamic power • Total energy

  11. Experiment Setup • Map an MPEG-2 decoder onto PE1 and PE2 • Setting 1: parameters of Intel 80200 Xscale processor • Setting 2: parameters based on Transmeta Crusoe processor scaled up to 70nm technology • Buffer’s specifications are Micro SDRAM parameters

  12. Experiment Setup (cont’d)

  13. A Motivating Problem • Under a performance constraint, how to minimize energy dissipation by trading off among: • dynamic frequency and voltage scaling policies, • multiple frequencies of processors, • processor customization catering for applications

  14. How do scheduling policies affect the constraint?

  15. Results on Underflow Possibilities Underflow possibilities associated with scheduling periods (733MHz)

  16. Results on Underflow Possibilities (cont’d) Underflow possibilities associated with varying duty cycles (633MHz)

  17. Which is more sensitive to schedulers, the buffer’s energy or PE’s energy?

  18. Bounds of Buffer’s Energy Bounds of buffer’s maximum energy associated with the same frequencies of PEs with SDRAM buffers under varying duty cycles

  19. Bounds of Total Energy Bounds of maximum total energy associated with the same frequencies of PEs with SDRAM buffers under varying duty cycles

  20. A Motivating Problem • Under a performance constraint, how to minimize energy dissipation by trading off among: • dynamic frequency and voltage scaling policies, • multiple frequencies of processors, • processor customization catering for applications

  21. How to reduce energy by choosing frequencies without undermining the quality of service?

  22. Choosing Frequencies along the Boundary Bounds of maximum total energy associated with the combinations of frequencies of PEs with SDRAM buffers a duty cycle of 0.9

  23. Choosing Frequencies along the Boundary (cont’d) • Noting the surface almost monotonously increases with the frequencies except for the starting point • Choosing frequency combinations along the boundary of the area can minimize energy without violating the performance constraint

  24. A Motivating Problem • Under a performance constraint, how to minimize energy dissipation by trading off among: • dynamic frequency and voltage scaling policies, • multiple frequencies of processors, • processor customization catering for applications

  25. What to trade off if the frequencies are fixed?

  26. Shifting of the Best Duty Cycle Bounds of maximum total energy associated with the combinations of frequencies of PEs with data cache buffers varying duty cycles

  27. Summary • An analytical framework based on VCC to identify both performance and energy bounds • Studied the impacts of scheduler policies • Explored the tradeoffs of frequencies • Explored processor customizations

  28. Next Steps • Include more hardware details • Hierarchical cache systems • Communication mechanisms such as buses • Co-optimization algorithms • Detailed validations of the model

  29. EASEL: Engineering Architectures and Software for the Embedded Landscape

  30. Thank You!

More Related