80 likes | 233 Views
Lect 13: Cache Memory. Cache Memory. Large main memory system: High-capacity, relatively slow speed dynamic RAMs and EPROMs: 60ns or 70ns Processor speed: requires DRAMs with 30 or 40ns, Latest processors : 10ns 20ns Cache memory: small high-speed expensive static RAMs
E N D
Cache Memory • Large main memory system: High-capacity, relatively slow speed dynamic RAMs and EPROMs: 60ns or 70ns • Processor speed: requires DRAMs with 30 or 40ns, Latest processors : 10ns 20ns • Cache memory: small high-speed expensive static RAMs • Typically from 16KB to 256KB, xMB • Hit Ratio and Miss Ratio • Types of cache organization • Direct-mapped cache • N-way set associative cache • Fig 10.42, Fig 10.43 • Posted Write
82385DX Cache Controller • 82385 Cache Controller • Improves 386 DX performance • Reduce Average CPU wait states to nearly zero • Zero wait state posted memory writes • Hit rates up to 99% • SW transparent • Synchronous Dual Bus Architecture • Maps Full 386 DX address space (4GB) • Flexible Cache Mapping Polices • Direct Mapped or 2-way Set Associative • Supports Non-cacheable memory space • Unified Cache for Code and Data • 132-pin PGA package • Fig 4.44
Architecture of an 80386DX microcomputer with an 82385-based Cache Memory • Fig 10.45 • 386 local bus, 385 local bus, system bus • Signal Interfaces of the 82385DX • Fig 10.46 • Configuration Input • M/S (master/slave select): • 2W/D (two-way set associative or direct-mapped) • 385/386 Interface signals • Cache Memory Control Signals • CALEN (cache address latch enable) • CT/R (cache transmit/receive) • CS0-CS3(cache chip select) : four 8K bytes SRAMs • COEA, COEB, CWEA, CWEB(cache output enable, cache write enable)
Cache Address (1 of 8K Double words) A15 A14 A5 A4 A2 Line Select (1 of 8 lines) 17-bit Tag (1 of 217 pages) Set Address (1 of 1024 sets) • 82385 Local bus interface signals • BBE0-BBE3, BADS, BNA, BLOCK, BREADY • Bus Arbitration signals • 385 bus data transceiver and address latch control signals • See Fig 10.47, Fig 10.48 • Direct-Mapped Cache Organization • See Fig 10.50 • 386 DX Address Bus Bit Fields • 4GB: • 8K double word(32kB) page X 217 • page: 1024 sets of 8 double words (8x32 bits) • line: 32-bit Double word A31
Cache Directory Entry • 17-bit Tag: see Fig 10.51 • Tag valid bit • Line valid bits • Direct-Mapped Cache Operation • Read Hit • 10-bit set address -- select one of 1024 entries • 17-bit tag field with the tag • if Match and Tag and Line Valid bits are set --> hit • drives the cache • Read Miss • Mem -> cache and 386DX • Line miss - update the line valid bit only • Tag Miss - the upper address bits overwrite the previously stored tag the appropriate line valid bit is set, the other seven line valid bits are cleared • Write Hit • update memory and cache • directory is unaffected
Snoop Hit • cache: unaffected • the affected line: invalidated • Cache Flush • clear all tag valid bits • Write Miss • cache: unaffected • directory: unaffected • Two-way Set Associative Cache Organization • See Fig 10.53 • 4GB: • Two banks of 4K double word • 512 sets in each bank • line: 32-bit Double word Cache Address (1 of 4K Double words) A31 A14 A13 A5 A4 A2 Line Select (1 of 8 lines) 18-bit Tag (1 of 218 pages) Set Address (1 of 512 sets)
LRU replacement Algorithm • LRU bit • Cache Directory Entry • See Fig 10.54 • Two-way Set Associative Cache Operation • Read Hit • The two tags of this set are simultaneously compared with A14- A31 Both tag valid bits are checked Appropriate line valid bits are checked Update LRU bit • Read Miss • Check LRU bit • Mem-> the bank that the LRU bit points to • Update Tag, Tag valid bit, line valid bit