Availability in Globally Distributed Storage Systems

Availability in Globally Distributed Storage Systems Presented By Ala`a Ibrahim Daniel Ford, Franc¸ois Labelle, Florentina I. Popovici, Murray Stokely, Van-AnhTruong,LuizBarroso, CarrieGrimes, and Sean Quinlan

OUTLINE • Introduction • Disks failures • Correlated Failures • Fault Tolerance MechanismsMarkov Model of Stripe Availability • Markov Model Findings • Conclusions

Data Center

Data Center Components Server Components Interconnects  Racks Cluster of Racks

Data Center Components ALL THESE COMPONENTS CAN FAIL Server Components Interconnects  Racks Cluster of Racks

Cell, Stripe and Chunk Stripe 1 Stripe 2 Stripe 1 Stripe 2 GFS Instance 1 GFS Instance 2 Chunks Chunks Chunks Chunks CELL 2 CELL 1

Failure Sources • Failure Sources • Hardware – Disks, Memory etc. • Software – chunk server process • Network Interconnect • Power Distribution Unit • Availability • Reasons of unavailable • Overloaded • Crash or restart • Hardware error • Automated repair processes

Disks failures Node restarts Planned machine reboots Unplanned machine reboots Unknown

Fault Tolerance Mechanisms • Replication (R = n) • ‘n’ identical chunks (replication factor) are placed across storage nodes in different rack/cell/DC • Erasure Coding ( RS (n, m)) • ‘n’ distinct data blocks and ‘m’ code blocks • Can recover utmost ‘m’ blocks from the remaining ‘n-m’ blocks

Replication 5 replicas 1 Chunk Fast Encoding / Decoding Very Space Inefficient

Erasure Coding ‘n’ data blocks ‘m’ code blocks Encode ‘n + m’ blocks

Erasure Coding ‘n’ data blocks ‘m’ code blocks Encode ‘n’ data blocks ‘n + m’ blocks Decode Highly Space Efficient Slow Encoding / Decoding

Correlated Failures • Failure Domain • Set of machines that simultaneously fails from a common source of failure • Failure Burst • Sequence of node failures each occurring within a time window ‘w’ of the next • Window 120 s

Correlated Failures… Failure Burst (Window Size)

Markov Model • Chunk placement policy • Cell Simulation • trace-based simulation • Priority queue

Markov Chain

Conclusion • The findings provides a feedback for improving • Replication and encoding schemes • Recovery rate

Availability in Globally Distributed Storage Systems