Reducing Cache Traffic and Energy with Macro Data Load

Reducing Cache Traffic and Energywith Macro Data Load Lei Jin and Sangyeun Cho* Dept. of Computer Science University of Pittsburgh

Motivation • Data cache access is a frequent event • 20~40% of all instructions access data cache • Data cache energy can be significant (~16% in StrongARM chip [Montanaro et al. 1997]) • Reducing cache traffic leads to energy savings • Existing thoughts • Store-to-load forwarding • Load-to-load forwarding • Use available resources to keep data for reuse • LSQ [Nicolaescu et al. 2003] • Reorder buffer [Önder and Gupta 2001]

Macro Data Load (ML) • Previous works are limited by exact data matching • Same address and same data type • Exploit spatial locality in cache-port-wide data • Accessing port-wide data is free • Naturally fits datapath and LSQ width • Recent processors support 64 bits • Many accesses are less than 64 bits w/o ML w/ ML

ML Potential CINT2k CFP2k • ML uncovers more opportunities • ML especially effective with limited resource MiBench

ML Implementation • Architectural changes • Relocated data alignment logic • Sequential LSQ-cache access • Net impact • LSQ becomes a small fully associative cache with FIFO replacement

Result: Energy Reduction • Up to 35% (MiBench) energy reduction! • More effective than previous techniques CINT CFP MiBench

Reducing Cache Traffic and Energy with Macro Data Load

Reducing Cache Traffic and Energy with Macro Data Load

Presentation Transcript

Reducing Energy Consumption of Disk Storage Using Power Aware Cache Management

Reducing Server and Network Load with Shared Buffering

Reducing Data Center Energy Consumption via Coordinated Cooling and Load Management

Reducing Energy Consumption of Disk Storage Using Power-Aware Cache Management

Reducing Energy Consumption

Dual Data Cache

A Data Cache with Dynamic Mapping

Reducing Cache Miss Penalties

Reducing Server Data Traffic Using a Hierarchical Computation Model

Current CPDLC Data Delivery Time and traffic Load over VDL Mode 2

L1 Data Cache Decomposition for Energy Efficiency

Reducing Data Center Energy Costs

Reducing Multicast Traffic Load for Cellular Networks using Ad Hoc Networks

Wireless Cache Invalidation Schemes with Link Adaptation and Downlink Traffic

Reducing Waste and Energy Consumption

Saving Energy and Reducing Costs

Cost- and Energy-Aware Load Distribution Across Data Centers

Reducing Energy, Carbon and Costs

REDUCING ENERGY WASTE

DATA TRAFFIC

Traffic monitoring with FCD- and traffic signal data in Tampere region

Load And Traffic On The Hosting CPU