150 likes | 334 Views
32-bit Pipelined RISC Processor. Group 1. aka “Go Us”. CS m152b Lab section 1. Alice Wang Ann Ho Jason Fong. IF/ID. ID/EX. EX/MEM. MEM/WB. Register File. PC Control. ALU. Data Memory. Instruction Memory. General Review of a Pipelined Processor. ?. Two 16-bit words.
E N D
32-bit Pipelined RISC Processor Group 1 aka “Go Us” CS m152b Lab section 1 Alice Wang Ann Ho Jason Fong
IF/ID ID/EX EX/MEM MEM/WB Register File PC Control ALU Data Memory Instruction Memory General Review of a Pipelined Processor
? Two 16-bit words Data Memory 32-bit word One request 16-bit interface Request 32-bits Processor Memory 16-bits 16-bits Memory Controller Design 32-bit Design challenge: 32-bit processor with 16-bit memory interface On every memory access, need to get two words from memory Solution: Clock memory controller twice as fast as rest of processor Results in a memory access on the rising and falling edge of the processor’s clock cycle 32-bit
funct rt opcode rd rt rs rs rt opcode opcode rt opcode opcode opcode rs rd rs funct 21 19 24 22 31 28 31 28 27 25 24 22 24 22 21 19 31 28 31 28 31 28 27 25 31 28 24 22 27 25 27 25 3 0 3 0 imm16 imm16 imm16 imm16 15 0 15 0 15 0 15 0 Instruction Format General instruction format 4 bits remaining 28 bits vary according to instruction type R-type instruction unused unused I-type instruction unused unused J-type instruction unused unused
R-type instructions J-type instructions I-type instructions
Our_mult ALU with Multiplier
Multiplier Uses a series of shifts and additions 0 0 0 0 0 0 0 0 0 0 0 Example: 13 x 11 = 01101 x 01011 LO HI 01101 01101 x = 01101 = 01101 + + = 1 1 0 1 0 1 1 0 1 1 multiplicand 01011 1 1 multiplier = 143 0 0 1 0 0 0 1 1 1 1
Forward From ALU output Forward From memory output ID/EX EX/MEM MEM/WB Data Forwarding
PC Adder 1 Hold PC value Insert NOP NOP IF/ID Hardware NOP Insertion
Data Forwarding and Stall Insertions:Observed Results Sample program: Bubble-sort 6 numbers Assembler insertion of NOPs • Machine code size: 66 words of memory • Execution time: ~750 clock cycles Hardware data forwarding and NOP insertion: • Machine code size: 35 words of memory • Execution time: ~400 clock cycles
Data Forwarding and Stall Insertions:Observed Results Benefits: • Savings in memory and execution time • Much simpler assembler Drawbacks: • Hardware is now more complex Tradeoff between hardware complexity and software complexity Also demonstrates benefits of understanding the underlying architecture when designing an assembler
Conclusion • Some problems we encountered: • Off by one stage in pipeline • Lack of experience with VHDL • Order of bits from memory • In Conclusion... • Knowledge from previous courses • Further research • Simple RISC processor • Pipelining • Multiplier • Data Forwarding and Hardware NOP’s
References • Hennessey and Patterson, Computer Organization and Design (2nd Ed.), 1998, pages 476-495 • Donaldson, John L., “Pipeline Hazards”, • http://occs.cs.oberlin.edu/faculty/jdonalds/317/lecture08.html • Ercegovac, Intro To Digital Systems • Institute of Electronics, Information and Communication Engineers. "High Speed and Very Compact Two's Complement Serial/Parallel Multipliers Using Xilinx's FPGA"., Abdelkrim Kamel Oudjida 19 Nov. 2002, http://search.ieice.org/2001/pdf/e84-a_5_1339.pdf
Applause Please Thank You