620 likes | 773 Views
A Hybrid Approach of Failed Disk Recovery Using RAID-6 Codes: Algorithms and Performance Evaluation. Yinlong Xu University of Science and Technology of China. A joint work with Liping Xiang, John C.S. Lui and Qian Chang. I would like to thank John C.S. Lui , Raymond W. Yeung ,
E N D
A Hybrid Approach of Failed Disk Recovery Using RAID-6 Codes: Algorithms and Performance Evaluation YinlongXu University of Science and Technology of China A joint work with Liping Xiang, John C.S. Lui and Qian Chang
I would like to thank John C.S. Lui, Raymond W. Yeung, Patrick B.C. Lee, Alfred C.L. Ho!
Outline • Background • A Hybrid Recovery Approach for Single Disk Failure • Row-Diagonal Optimal Recovery (RDOR) for Single Disk Failure • A Recovery Scheme with Minimum Disk Reads • Balancing Disk Reads • Optimizing Memory Usage • Performance Evaluation • Summary 3/60
Outline • Background • Hybrid Recovery Approach for Single Failure • Row-Diagonal Optimal Recovery (RDOR) for Single Disk Failure • A Recovery Scheme with Minimum Disk Reads • Balancing Disk Reads • Optimizing Memory Usage • Performance Evaluation • Summary 4/60
Remark: • This work can be applied to two RAID-6 codes, RDP and EVENODD. • This talk takes RDP as an example. 5/60
RDP Code • In a form of a (p1)×(p+1) matrix, p is a prime number. • The first p1 columns are information columns. • The last two are parity columns (row parity, diagonal parity). • Note: With RDP code,all information data is recoverable when any two disks fail. Row parity d0,4= d0,0 d0,1 d0,2 d0,3 Diagonal parity d0,5= d0,0 d2,3 d3,2 d1,4 Missing Diagonal 6/60
Outline of our work • Problem: The recovery of single disk failure in RDP coded systems • Motivation: RDP code tolerates two disk failures, but the probability of single disk failure is much higher than double disk failures. • Contributions: • Give the lower bound of disk reads • Propose a recovery scheme, s.t. • Disk reads matches the lower bound, reduced by 1/4. • Balancing disk reads • Minimum extra memory usage: (p1)/2 blocks • XOR operations: No more than conventional scheme 7/60
A Naive Recovery Scheme for Single Disk Failure of RDP Code –– Case(1) • Case 1: Single information disk fails • Row parity disk and other information disks are used for the recovery. Disk6 d0,1 d1,1 d2,1 d3,1 The recovery of Disk 1 8/60
A Naive Recovery Scheme for Single Disk Failure of RDP Code –– Case(2) • Case 2: Single parity disk fails • The recovery is equivalent to the parity encoding Disk6 d0,5 d1,5 d2,5 d3,5 The recovery of diagonal parity disk 9/60
Features of the Naive Recovery Scheme • Only uses single parity column for single disk failure recovery, however, there are two parity columns in the array. • (p1)2 symbols are read from the disks for the recovery. 10/60
Questions • Whether the disk reads can be reduced for the recovery of single disk failure? • What iftwo parity disks are used for single disk failure recovery? 11/60
Some Benefits from Reducing Disk Reads • Speeding up the recovery • Relieving system bus load • Relieving disk load • Enhancing user’s service performance • Saving disk energy • … 12/60
Outlines • Background • A Hybrid Recovery Approach for Single Failure • Row-Diagonal Optimal Recovery (RDOR) for Single disk Failure • A Recovery Scheme with Minimum Disk Reads • Balancing Disk Reads • Optimizing Memory Usage • Performance Evaluation • Summary 13/60
Row Parity or Diagonal Parity? • Either row parity or diagonal parity can be used to recover an erasure symbol d0,1 can be recovered by row parity d0,1 can also be recovered by diagonal parity 14/60
Notes: There are 4 overlapping symbols which need to be readtwice. If the 4 overlapping symbols are per-stored in memory, the number of disk reads is reduced to 164=12<16. A Hybrid Recovery Approach for SingleDisk Failure • Using diagonal parity to recover d0,1; • Using diagonal parity to recover d1,1; • Using row parity to recover d2,1; • Using row parity to recover d3,1. Overlapping symbols 15/60
Consideration of Hybrid Recovery Approach • By using memory read instead of disk read • The recovery process will be speeded up Note: Memory read is 100 times faster than disk read • Communication load of the storage system will be reduced • During the recovery, the more overlapping symbols, the fewer symbols to be read from disks. • Questions • What is the lower bound of disk reads for single disk failure recovery? • How to design a recovery scheme which matches this lower bound? 16/60
Outlines • Background • A Hybrid Recovery Approach for Single Failure • Row-Diagonal Optimal Recovery (RDOR) for Single Failure • Recovery Scheme with Minimum Disk Reads • Balancing Disk Reads • Optimizing Memory Usage • Performance Evaluation • Summary 17/60
Diagonal parity Row parity Row Parity Sets • Ri= {di,k|0 k p1}-----the i-throw parity set. E.g. R0={d0,0, d0,1, d0,2, d0,3, d0,4}. • Each symbol in Rican be recovered by other symbols inRi. Because d0,4=d0,0d0,1d0,2d0,3 , so d0,1=d0,0d0,2d0,3d0,4 18/60
Row parity Diagonal parity Diagonal Parity Sets • Dj= {di,k|(i+k) mod p = j, 0 i p2, 0 k p} is the j-thdiagonal parity set. E.g. D1={d1,0, d0,1, d3,3, d2,4, d1,5} • Each symbol in Djcan be recovered by other symbols inDj. d0,1=d1,0d3,3d2,4d1,5 19/60
Overlapping Symbols • There is just one common (named overlapping) symbol between each pair of Ri and Dj. e.g. R1∩D3={d1,2} R1 D3 20/60
Special Cases of Parity Sets Only belong to their row parity sets Only belong to their diagonal parity sets • Disk p can only be recovered by diagonal parity. • This work only consider the recovery of Disk k, with k ≠ p. 21/60
Recovery Combination • A combination of parity sets (Ri, … , Dj) is corresponding to a recovery scheme. • E.g. Using recovery combination (D1, D2, R2, R3) to recover Disk 1. 22/60
Recovery Combination • A combination of parity sets (Ri, … , Dj) is corresponding to a recovery scheme. • E.g. Using recovery combination (D1, D2, R2, R3) to recover Disk 1. • Using D1 to recover d0,1; 23/60
Recovery Combination • A combination of parity sets (Ri, … , Dj) is corresponding to a recovery scheme. • E.g. Using recovery combination (D1, D2, R2, R3) to recover Disk 1. • Using D1 to recover d0,1; • Using D2 to recover d1,1; 24/60
Recovery Combination • A combination of parity sets (Ri, … , Dj) is corresponding to a recovery scheme. • E.g. Using recovery combination (D1, D2, R2, R3) to recover Disk 1. • Using D1 to recover d0,1; • Using D2 to recover d1,1; • Using R2 to recover d2,1; 25/60
Recovery Combination • A combination of parity sets (Ri, … , Dj) is corresponding to a recovery scheme. • E.g. Using recovery combination (D1, D2, R2, R3) to recover Disk 1. • Using D1 to recover d0,1; • Using D2 to recover d1,1; • Using R2 to recover d2,1; • Using R3 to recover d3,1. 26/60
Disk0 Disk1 Disk2 Disk3 Disk4 Disk5 Number of Overlapping Symbols • Assumption • Disk k is in erasure • p1 symbols d0,k, d1,k, … , dp-2,kneed to be recovered • A Recovery Scheme • t erasure symbols from diagonal parity sets • The other p1t symbols from row parity sets • Conclusion • t(p1t) = (t(p1)/2)2+(p1)2/4overlapping symbols • When t=(p1)/2, the number of overlapping symbols is maximized. 27/60
Disk0 Disk1 Disk2 Disk3 Disk4 Disk5 Lower Bound of Disk Reads for SingleFailure Recovery • The maximum number of overlapping symbols is (p1)2/4. • A maximum of (p1)2/4 symbols may be reduced from disk read for recovery. • Conclusion: The lower bound of disk reads for recovery is (p1)2(p1)2/4 =3(p1)2/4. Symbols be read Overlapping symbols 28/60
Read Optimal Recovery Scheme • Any recovery combination consists of (p1)/2 row and (p1)/2 diagonal parity sets is read optimal. -----Named Row-Diagonal Optimal Recovery (RDOR). • Conclusion: RDOR reduces approximately 25%disk reads compared with the naive scheme. 29/60
Outlines • Background • Hybrid Recovery Approach for Single Failure • Row-Diagonal Optimal Recovery (RDOR) for Single Failure • A Recovery Scheme with Minimum Disk Reads • Balancing Disk Reads • Optimizing Memory Usage • Performance Evaluation • Summary 30/60
Example: Two Read Optimal Recovery Combinations (R0, R1, R2, D3, D4, D5) (D0, D1, R2, D3, R4, R5) Disk reads: 5 4 3 3 4 5 3 4 4 4 4 4 4 3 Unbalanced Balanced 31/60
Problem and Questions • During recovery, the disk with the most read operations may slow down the recovery. • Questions • To reduce the recovery time, what is a balanced and read-optimal recovery scheme? • It reads the same (or almost the same) number of symbols from different disks. 32/60
Average Reads from Each Disk • The minimum number of disk reads for recovery is 3(p1)2/4. • To achieve read optimal, (p1)/2 symbols will be read from Disk p(diagonal parity disk). • Conclusion: Average number of symbols to be read from the other surviving disks (except for Disk k and Disk p) is [3(p 1)2/4 (p 1)/2] / (p 1)= (3p 5)/4. Note: A balanced read-optimal recovery should read (p1)/2 symbols from Disk p and (3p 5)/4 symbols from each of other disks 33/60
Example: A Balanced Example (D0, D1, R2, D3, R4, R5) E.g. p=7 Total: 3(p 1)2/4 =27 Disk 7: (p 1)/2=3 Each of other disks: (27 3)/6=4 4 4 4 4 4 4 3 Balanced 34/60
Recovery Sequence • Define a recovery sequence x0, x1, ... , xp2, xp1 corresponds to a recovery combination, where • xi=0 means that di,k is recovered from its row parity set • xi=1 means that di,k is recovered from its diagonal parity set • E.g. (D0, D1, R2, D3, R4, R5) 1 1 0 1 0 0 0 Additional symbol 35/60
Condition of Read Optimal and Balanced Recovery Sequence • Recovery sequence {xi}0≤i≤p1 is read optimal and balanced if and only if • Read optimal x0+x1+…+xp2+xp1=(p1)/2 (1) • Symbols in missing diagonal and added row are recovered by row parity. xp1 k=xp1=0 (2) • (3p 5)/4 symbols to be read from Disk j (0≤j≤p1, j≠k) (3) 36/60
Read Optimal and Balanced Recovery –– An Example • (D0, D1, R2, D3, R4, R5) is a read optimal and balanced recovery combination for p=7 and k=0. • Corresponding recovery sequence x0x1...x5x6=1101000 satisfies: • x0+x1+…+x5+x6=(p1)/2=3 (1) • xp1 k=xp1=0 (2) 37/60
Disk 3 x2=0, d2,3 is read x0=1, d4,3 is used Condition of Read Optimal and Balanced Recovery Sequence (Cont.) • When xi=0 or x<i+jk>p=1, di,j in Disk j is used for recovery. • When di,j is used for recovery, xi(1x<i+jk>p)=0. • The number of symbols that need to be read in Disk j is Number of symbols not read from Disk j 38/60
Read Optimal and Balanced Recovery –– An Example (Cont.) • Recovery sequence x0x1...x5x6=1101000 also satisfies: • 4 symbols to be read from Disk j (0≤j≤6, j≠0) (3) Disk 3 d4,3 39/60
Read Optimal and Balanced Recovery –– An Example (Cont.) • Recovery sequence x0x1...x5x6=1101000 also satisfies: • 4 symbols to be read from Disk j (0≤j≤6, j≠0) (3) Disk 3 E.g. x0=1, d4,3 is used; d4,3 40/60
Read Optimal and Balanced Recovery –– An Example (Cont.) • Recovery sequence x0x1...x5x6=1101000 also satisfies: • 4 symbols to be read from Disk j (0≤j≤6, j≠0) (3) Disk 3 E.g. x0=1, d4,3 is used; x1=1, d5,3 is used; d5,3 41/60
Read Optimal and Balanced Recovery –– An Example (Cont.) • Recovery sequence x0x1...x5x6=1101000 also satisfies: • 4 symbols to be read from Disk j (0≤j≤6, j≠0) (3) Disk 3 E.g. x0=1, d4,3 is used; x1=1, d5,3 is used; x2=0, d2,3 is used; d2,3 42/60
Read Optimal and Balanced Recovery –– An Example (Cont.) • Recovery sequence x0x1...x5x6=1101000 also satisfies: • 4 symbols to be read from Disk j (0≤j≤6, j≠0) (3) Disk 3 E.g. x0=1, d4,3 is used; x1=1, d5,3 is used; x2=0, d2,3 is used; x3=1, d0,3 is used; d0,3 43/60
Read Optimal and Balanced Recovery –– An Example (Cont.) • Recovery sequence x0x1...x5x6=1101000 also satisfies: • 4 symbols to be read from Disk j (0≤j≤6, j≠0) (3) Disk 3 E.g. x0=1, d4,3 is used; x1=1, d5,3 is used; x2=0, d2,3 is used; x3=1, d0,3 is used; x4=0, d4,3 is used; d4,3 44/60
Read Optimal and Balanced Recovery –– An Example (Cont.) • Recovery sequence x0x1...x5x6=1101000 also satisfies: • 4 symbols to be read from Disk j (0≤j≤6, j≠0) (3) Not be read Disk 3 E.g. x0=1, d4,3 is read; x1=1, d5,3 is read; x2=0, d2,3 is read; x3=1, d0,3 is read; x4=0, d4,3 is read; x5=0, d5,3 is read; d5,3 45/60
Recovery Set • Given a recovery sequence {xi}0≤i≤p1, define A={ i | xi=1, 0≤i≤p1} as the recovery set. x0x1...x5x6=1101000 A={0,1,3} 46/60
Recovery Set As if and only if i∈ A and<i+t>p∈A, xix<i+t>p= 1. So • Balanced Recovery Set • A corresponds to a balanced sequence, if and only if • For any t (1≤ t≤ p1), t has a multiplicity of (p3)/4 in the multi-set MA={a1a2| a1, a2∈A, a1≠a2} 47/60
The Existence of Read Optimal and Balanced Recovery Set • By using the concept of (partial) difference-set, we have the following conclusion. • Given a prime number p, and the nonzero squares set D={i2|1≤i≤(p−1)/2} in Fp is a difference-set. • There is g∈Fp, s.t. A=D+g corresponds to a read-optimal and balanced recovery sequence {xi}0≤i≤p−1 for the recovery of Disk k (k≠p). 48/60
Reviewing on Read Balance Problem Find out the average number of disk reads on each disk . Define recovery sequence and recovery set to describe recovery scheme. Find out the constraint conditions that a recovery set is read optimal and balanced. Using the concept of (partial) difference set to solve these constraint conditions. The read optimal and balanced recovery scheme corresponds to the solved recovery set. 49/60
Outlines • Background • Hybrid Recovery Approach for Single Failure • Row-Diagonal Optimal Recovery (RDOR) for Single Failure • Recovery Scheme with Minimum Disk Reads • Balancing of Disk Reads • Optimizing Memory Usage • Performance Evaluation • Summary 50/60