1 / 23

EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs The University of Texas at Austin Spring semester, 2010. Compile-time outline of what we will do. Introduction and Focus on the Fundamentals Tradeoffs Mechanisms: run-time and compile-time

inoke
Download Presentation

EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. EE 382N Microarchitecture Yale Patt, instructor Eiman Ebrahimi, Khubaib, TAs The University of Texas at Austin Spring semester, 2010

  2. Compile-time outline of what we will do • Introduction and Focus on the Fundamentals • Tradeoffs • Mechanisms: run-time and compile-time • Approaches to concurrency • Impact of Multi-core

  3. Lecture 1: Introduction and Focus

  4. Outline • A science of tradeoffs • The transformation hierarchy • The algorithm, the compiler, the microarchitecture • The microarchitecture view • The physical view • Speculation • Design points • Design Principles • Role of the Architect • Numbers • Embedded processors – because… • Thinking outside the box

  5. Trade-offs, the overriding consideration:What is the cost? What is the benefit? • Global view • Global vs. Local transformations • Microarchitecture view • The three ingredients to performance • Physical view • Wire delay (recently relevant) • Power, energy (recently relevant) • Soft errors (recently relevant) • Partitioning (since the beginning of time)

  6. Trade-offs, the overriding consideration:What is the cost? What is the benefit? • Global view • Global vs. Local transformations • Microarchitecture view • The three ingredients to performance • Physical view • Wire delay (recently relevant) • Power, energy (recently relevant) • Soft errors (recently relevant) • Partitioning (since the beginning of time)

  7. Problem Algorithm Program ISA (Instruction Set Arch) Microarchitecture Circuits Electrons

  8. The Triangle(originally from George Michael) • Only the programmer knows the ALGORITHM • Pragmas • Pointer chasing • Partition code, data • Only the COMPILER knows the future (sort of ??) • Predication • Prefetch/Poststore • Block-structured ISA • Only the HARDWARE knows the past • Branch directions • Cache misses • Functional unit latency

  9. Trade-offs, the overriding consideration:What is the cost? What is the benefit? • Global view • Global vs. Local transformations • Microarchitecture view • The three ingredients to performance • Physical view • Wire delay (recently relevant) • Power, energy (recently relevant) • Soft errors (recently relevant) • Partitioning (since the beginning of time)

  10. Trade-offs, the overriding consideration:What is the cost? What is the benefit? • Global view • Global vs. Local transformations • Microarchitecture view • The three ingredients to performance • Physical view • Wire delay (recently relevant) • Power, energy (recently relevant) • Soft errors (recently relevant) • Partitioning (since the beginning of time)

  11. Outline • A science of tradeoffs • The transformation hierarchy • The algorithm, the compiler, the microarchitecture • The microarchitecture view • The physical view • Speculation • Design points • Design Principles • Role of the Architect • Numbers • Embedded processors – because… • Thinking outside the box

  12. Speculation • Why good? – improves performance • How? – we guess • Branch prediction • Way prediction • Data prefetching • Value prediction • Address prediction • Recent visibility: Memory disambiguation • Why bad? – consumes energyl

  13. Design Principles • Critical path design • Bread and Butter design • Balanced design

  14. Numbers(because comparch is obsessed with numbers) • The Baseline – Make sure it is the best • Superlinear speedup • Recent example, one core vs. 4 cores with ability to fork • The Simulator you use – Is it bug-free? • Understanding vs “See, it works!” • 16/64 • You get to choose your experiments • SMT, throughput: run the idle process • Combining cores: what should each core look like • You get to choose the data you report • Wrong path detection: WHEN was the wrong path detected • Never gloss over anomalous data

  15. Embedded processors vs General Purpose(because it represents 99%+ of the market) • Mostly everything about general purpose applies • Advantage: easier to design wholistically • advantage of special purpose or limited purpose • Greater use of ASICs • Although that could change • Why VLIW is good (though bad for general purpose) • Partitioning is critical • Memory latency • Reconfigurability

  16. Finally, people are always telling you:Think outside the box

  17. I prefer: Expand the box

  18. Something we are all familiar with:Look-ahead Carry Generators • They speed up ADDITION • But why do they work?

  19. Addition 12 9 21 182378 645259 827637

More Related