390 likes | 443 Views
This chapter delves into the testing challenges of Field Programmable Gate Arrays (FPGAs), with a focus on their programmability and complexity. It covers various testing approaches, including Built-In Self Test (BIST) and diagnosis of different FPGA resources. It also discusses the architecture, configuration, and testing problems unique to FPGAs, providing insights into routing resources and embedded processor-based testing. The text explores the dynamic partial reconfiguration capability of FPGAs and the evolving FPGA architectures over time.
E N D
Chapter 12 Field Programmable Gate Array Testing
What is this chapter about? • Field Programmable Gate Arrays (FPGAs) • Have become a dominant digital implementation media • Reconfigurable to implement any digital logic function • Focus on • Testing challenges due to programmability and complexity • Overview of testing approaches • Test and diagnosis of various resources • New frontiers in FPGA testing
FPGA Testing • Overview of FPGAs • Architecture, Configuration, & Testing Problem • Testing Approaches • BIST of Programmable Resources • Logic Resources • Logic Blocks, I/O Cells, & Specialized Cores • Diagnosis • Routing Resources • Embedded Processor Based Testing • Concluding Remarks
Field Programmable Gate Arrays • Configuration Memory • Programmable Logic Blocks (PLBs) • Programmable Input/Output Cells • Programmable Interconnect Typical Complexity = 5 million – 1 billion transistors
Basic FPGA Operation • Writing configuration memory (configuration) defines system function • Input/Output Cells • Logic in PLBs • Connections between PLBs & I/O cells • Changing configuration memory data (reconfiguration) changes system function • Can change at anytime • Even while system function is in operation • Dynamic partial reconfiguration 1110011010001000100101010001011100010100101010101001001000100010101001001001100100100001111000110010100010000110010001010001001001001000101001010101001001001010001010010100010100101001000100101010111010101010101010101010101111011111000000000000001101001111100001001110000011100100101000000001111100100100010100111001001010000111100011100010010101010101010101010010100101010100100101010101010101001001001
PC PC PC PC FPGA Architectures • Early FPGAs • NxN array of unit cells • Unit cell = CLB + routing • Special routing along center axes • I/O cells around perimeter • Next Generation FPGAs • MxN array of unit cells • Added small block RAMs at edges • More Recent FPGAs • Added larger block RAMs in array • Added multipliers • Added Processor Cores (PC) • Latest FPGAs • Added DSP cores w/multipliers • I/O cells along columns for BGA
Truth table A B S Logic symbol A 0 1 Z S A B Z 0 0 0 0 0 0 1 0 0 1 0 1 0 1 1 1 1 0 0 0 1 0 1 1 1 1 0 0 1 1 1 1 S Z B 0 1 Combinational Logic Functions • Gates are combined to create complex circuits • Multiplexer example • If S = 0, Z = A • If S = 1, Z = B • Common digital circuit • Heavily used in FPGAs • Select input (S) controlled by configuration memory bit
0 1 0 1 0 1 1 0 0 1 0 1 0 1 0 1 0 1 0 1 0 1 Z Truth table 1 A B S Multiplexer 0 1 Z S A B Z 0 0 0 0 0 0 1 0 0 1 0 1 0 1 1 1 1 0 0 0 1 0 1 1 1 1 0 0 1 1 1 1 1 0 1 B A S Look-up Tables • Using multiplexer example • Configuration memory holds truth table • Input signals connect to select inputs of multiplexers to select output value of truth table for any given input value
Carry & Control Logic Flip-flop/ Latch carry out LUT/ RAM Output Q output Input[1:4] 4 3 Control clock, enable, set/reset carry in Basic Prog. Logic Block (PLB) Structure • Look-up table (LUT) for combinational logic • Store truth table in LUT (typically 3 to 6 inputs) • Some LUTs can also act as RAM/shift register • Flip-flops for sequential logic • Programmable clock enable, set/reset • Special logic • Large logic functions with Shannon expansion • Fast carry for adders and counters
Data In 0 1 1 0 0 0 1 1 en3 en4 en5 en7 en2 en0 en1 en6 0 1 0 1 0 1 0 1 0 1 0 1 0 1 Z In0 In1 In2 Address Decoder Write Enable In0 In1 In2 Look-up Table Based RAMs • Normal LUT mode performs read operations • Address decoder with write enable generates load signals to latches for write operations • Small RAMs but can be combined for larger RAMs Write Address Read Address
Input/Output Cells • Bi-directional buffers • Programmable for input or output signals • Tri-state control for bi-directional operation • Flip-flops/latches for improved timing • Set-up and hold times • Clock-to-output delay • Pull-up/down resistors • Routing resources • Connections to core of array • Programmable I/O voltage & current levels Tri-state Control to/from internal routing resources Bi- directional Buffer Output Data Pad Input Data
Wire A config bit Wire B Interconnect Network • Wire segments of varying length • xN = N PLBs in length • Typical values of N = 1, 2, 4, 6, 8 • Long lines • xH = half the array in length • xL = full array in length • Programmable Interconnect Points (PIPs) • Transmission gate connects to 2 wire segments • Controlled by configuration memory bit • Four basic types of PIPs
Programmable Interconnect Points • Break-point PIP • Connect or isolate 2 wire segments • Cross-point PIP • 2 nets straight through • 1 net turns corner and/or fans out • Compound cross-point PIP • Collection of 6 break-point PIPs • Can route 2 isolated signal nets • Multiplexer PIP • Directional and buffered • Main routing resource in recent FPGAs • Select 1-of-N inputs for output • Decoded MUX PIP – N configuration bits select from 2N inputs • Non-decoded MUX PIP – 1 configuration bit per input
= PLBs = routing resources = special cores = I/O cells Recent Architectural Trends • Addition of specialized cores: • Memories • Single and dual-port RAMs • FIFO (first-in first-out) • ECC (error correcting codes) • Digital signal processors (DSPs) • Multipliers • Accumulators • Arithmetic/logic units (ALUs) • Embedded processors • Hard core (dedicated processors) • With dedicated program/data memories • Otherwise, programmable RAMs in FPGA used for program/data memories • Soft core (synthesized from a HDL)
FPGA Resources • Types and sizes of resources vary with FPGA family • Example: LUTs vary from 3-input to 6-input • 4-input LUTs are most common • Typical ranges for some commercially available FPGAs
clock PROM with Config Data data out CCLK FPGA in Master Mode Din Dout CCLK FPGA in Slave Mode Din Dout CCLK FPGA in Slave Mode Din Dout Configuration Interfaces • Master mode (Serial or Parallel options) • FPGA retrieves configuration from ROM at power-up • Slave (Serial or Parallel options) • FPGA configured by external source (i.e., a P) • Used for dynamic partial reconfiguration • Boundary Scan Interface • 4-wire IEEE standard serial interface for testing • Write and read access to configuration memory • Interfaces to FPGA core internal routing network • Not available in all FPGAs
FPGA Configuration Memory • PLB addressable • Good for partial reconfiguration • X-Y coordinates of PLB location to be written • “Z” coordinate identifies which resources will be configured • Frame addressable • Vertical or horizontal frame • Vertical frames most common • Access to all PLBs in frame • Only portion of logic and routing resources accessible in a given frame • Many frames required to configure PLBs & routing
Configuration Techniques • Full configuration & readback • Simple configuration interface • Automatic internal calculation of frame address • Long download time for large FPGAs • Partial reconfiguration & readback • Only change portions of configuration memory with respect to reference design • Reduces download time for reconfiguration • Requires a more complicated configuration interface • Command Register (CMR) • Frame Length Register (FLR) • Frame Address Register (FAR) • Frame Data Register (FDR)
Configuration Techniques • Compressed configuration • Requires multiple frame write capability • Write identical frames of config data to multiple frame addresses • Extension of partial reconfiguration interface capabilities • Frame address is much smaller than frame of configuration data • Reduces download time for initial configuration depending on • Regularity of system function design • % utilization of array • Unused portions written with default configuration data
FPGA Testing Taxonomy • On-line test while system is operational • Off-line test while system is out-of-service • Application-dependent testing tests only those FPGA resources used by intended system function • Application-independent testing tests all FPGA resources
FPGA Test Configurations • More test configurations required for routing resources than for logic resources • Data below from publications on actual test configuration implementations in commercial FPGAs
LUT C 8x1 Cout Smux D2-0 SOmux 3 LUT S 8x1 Sout CEmux D3 SRmux C7 C6 C5 C4 C3 C2 C1 C0 CB5 FF CB4 D2-0 CB3 Clock Enable Set/Reset 111 110 101 100 011 010 001 000 LUT out Clock CB0 CB1 CB2 0 1 0 1 0 1 0 1 = Configuration Memory Bit CB A Simple PLB Architecture • Two 3-input LUTs • Can implement any 4-input combinational logic function • Can implement full adder • Carry in LUT C • Sum in LUT S • 1 flip-flop • Programmable: • Active levels • Clock edge • Set/reset • 22 configuration memory bits • 8 per LUT • C7-C0 and S7-S0 • 6 control bits • CB5-CB0
Test Configurations for Simple PLB • All configuration memory bits must be tested for both logic values (0 and 1) assuming exhaustive input patterns • Output effects for each logic value must be observed • Exclusive-OR (XOR) and exclusive-NOR (XNOR) functions are good for testing LUTs • Put opposite functions in adjacent LUTs to produce opposite logic values at inputs to subsequent logic functions • Fault coverage results below are based on collapsed single stuck-at gate-level fault model (174 faults total)
BIST for FPGAs • Basic idea: • Program some logic resources to act as • Test pattern generators (TPGs) • Output response analyzers (ORAs) • Resources under test • Logic resources as blocks under test (BUTs) • Routing resources as wires under test (WUTs) • Goal: • Minimize number of test configurations to minimize download time • Download time dominates total test time
TPG and ORA Implementations • TPG implementation depends on test algorithm • May be implemented in different resources (see table below) • Multiple TPGs prevent faulty TPG from escaping detection • Lower bound on number of PLBs per TPG, TPLB = BIN NFF • BIN = number of inputs to BUT • NFF = number of FFs/PLB • ORAs most efficiently implemented in PLBs • Number of PLBs needed for ORAs, OPLB = (NBUT×BOUT) NFF • BOUT = number of outputs from BUT • NBUT = number of BUTs
TPG Algorithms • Small logic functions (PLBs, IOBs) can be tested with pseudo-random test patterns • LFSRs or counting patterns • Large logic functions (RAMs, DSPs) require specialized test algorithms for high fault coverage • Below are examples of typical RAM test algorithms Notation: w0 = write 0 (or all 0’s), r1 = read 1 (or all 1’s) ↑= address up, ↓= address down, ↨ = address either way
BUTj output BUTk output BUTj output BUTk output Pass/ Fail shift data shift mode Pass/ Fail BUTj output1 BUTk output1 BUTj outputn BUTk outputn Pass/ Fail Output Response Analyzers • Comparison-based • XOR with OR feedback from flip-flop • Latches mismatches observed due to faults • Results retrieval • ORA with shift register • Requires additional logic • Configuration memory readback • Read contents of ORA flip-flops • Good with partial configuration memory readback capabilities
Basic Comparison =BUT =TPG =ORA Logic Resource BIST Architectures • Basic comparison • Multiple TPGs drive alternating columns (rows) of blocks under test (BUTs) • BUTs in center of array observed by 2 sets of ORAs and compared with 2 other BUTs • BUTs along edges of array observed by only 1 set of ORAs • Some loss of diagnostic resolution • Originally used to test PLBs • Later used to test specialized cores
Circular Comparison =BUT =TPG =ORA Logic Resource BIST Architectures • Circular Comparison • Multiple TPGs drive alternating columns (rows) of blocks under test (BUTs) • All BUTs observed by 2 sets of ORAs and compared with 2 other BUTs • Good diagnostic resolution • Originally used to test specialized cores • Later used to test PLBs and I/O cells
Expected Results expected results test patterns =BUT =TPG =ORA Logic Resource BIST Architectures • Expected Results comparison • Multiple TPGs • One set of TPGs drive BUTs • Other set of TPGs produce expected results for comparison with outputs of BUTs • BUTs observed by 1 set of ORAs and compared with expected results from TPGs • Simple diagnosis since failing ORA position indicates faulty BUT • Good when expected results can be algorithmically generated easily • Example: RAM test algorithms • Originally used to test RAM cores
Logic Resource Diagnostic Procedure • Record ORA results; 1= failure indication. • For every set of 2 or more consecutive ORAs with 0s, enter 0s for all BUTs observed by these ORAs; the BUTs are fault-free. • For every adjacent 0 and 1 followed by an empty space, enter 1 to indicate BUT is faulty; continue while such entries exist. • If an ORA indicates a failure but both BUTs monitored by the ORA are fault-free, one of the following conditions exist: • A fault in routing resources between one of the BUTs and the ORA, • ORA is faulty, or • There are more than 2 consecutive BUTs with equivalent faults (for circular comparison only); reorder circular comparison and repeat test and diagnostic procedure. • Remaining BUTs marked as unknown may be faulty; reorder circular comparison or rotate basic comparison architecture by 90, repeat test and diagnostic procedure.
Diagnostic Procedure Examples • Note that B4 and B5 have equivalent faults in Example A • Circular comparison provides better diagnostic resolution • Also indicates when more than 2 consecutive BUTs with equivalent faults (Example C)
TPG WUTs T T T T T ORA O O O O O STAR Testing Routing Resources • Comparison-based BIST approach • Developed for on-line FPGA BIST • Testing restricted to routing resources for 2 rows or 2 columns of PLBs • Small Self-Test AReas (STARs) • Comparison-based ORA • Later applied to off-line BIST • Fill FPGA with STARs • Tests run concurrently • Diagnostic resolution to STAR • Easier BIST development • But more BIST configurations FPGA
parity-check based-ORA TPG TPG C1 C0 Par WUTs WUTs parity bit + ORA ORA Testing Routing Resources • Original parity-based BIST approach • Parity bit routed over fault-free resources • What is fault-free until you’ve tested it? • Modified parity-based approach • N-bit up-counter with even parity, and • N-bit down-counter with odd parity • Gives opposite logic values for • Stuck-on PIPs & bridging faults • Parity used as test pattern • N+1 wires under test • Good for small PLBs • like our simple PLB example • Make STARs as small as possible • Better diagnostic resolution • Easier BIST development
=TPG =ORA Testing Routing Resources • Testing typically separated by routing resources • Global - interconnects non-adjacent logic resources • Local - interconnects adjacent logic resources and connects logic resources to global routing • Additional test configurations swap positions of TPGs and ORAs to reverse direction of signal flow to test directional, buffered routing resources • Multiplexer PIPs are a good example local routing adjacent PLBs global routing local routing PLB feed-through
Reducing Test Time • Orient BIST architecture to configuration memory • Align along rows/columns depending on FPGA structure • Downloading BIST configurations • Compressed configuration for initial download • Partial reconfiguration for subsequent downloads • Reduce number of frames written between configurations • Keep routing constant between BIST configurations • Optimize order of BIST configuration application • Retrieving BIST results • Partial configuration memory readback • Eliminates ORA logic for scan chain • Allows concurrent testing of more resources • Minimize number of frames to be read • Dynamic partial reconfiguration • Read BIST results after a series of BIST configurations • Slight loss in diagnostic resolution
= BUT = ORA Processor core, TPGs and interface to ICAP circuitry Processor core, TPGs and interface to ICAP circuitry Embedded Processor Based BIST • New area of R&D in FPGA testing • Basic idea: • Embedded processor core • Hard or soft core • Configures FPGA for BIST • Via internal configuration access port (ICAP) • Alternative: download initial BIST configuration • Executes BIST sequence • May provide TPG functionality • Retrieves BIST results • May perform diagnostic procedure • Reconfigures FPGA for subsequent BIST configurations • Soft core requires two test sessions to test area occupied by processor core during first test session Test session #1 Test session #2
Embedded Processor BIST • Overall reduction in total test time • Algorithmic reconfiguration faster than external download • 10 to 25 times faster • Results below from actual implementation in commercial FPGA • Can be loaded into processor program memory for on-demand BIST and diagnosis of FPGA • Good for fault-tolerant applications where system function is reconfigured around diagnosed fault(s)
Concluding Remarks • Growing use of FPGAs in systems and SOCs • FPGA testing is necessary but difficult due to • Programmability • Complex programmable interconnect network • Constantly growing size and changing architectures • Incorporation of new and different specialized cores • Test & diagnosis allows fault-tolerant applications • New FPGA capabilities assist in testing solutions • Dynamic partial reconfiguration and readback • Configuration/reconfiguration by embedded processor cores