Repairable Fountain Codes

Repairable Fountain Codes MegasthenisAsteris, AlexandrosG. Dimakis IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 32, NO. 5, MAY 2014

Outline • Introduction • Problem Description • Prior Work • Repairable Fountain Codes • A Graph Perspective • Simulations • Conclusion

Introduction • Fountain codes [2], [3], [4] form a new family of linear erasure codes with several attractive properties. • For a given set of k input symbols, produces a potentially limitless stream of output symbols. • Randomly selected subset of (1+ ɛ)k encoded symbols, a decoder should be able to recover the original k input symbols with high probability (w.h.p.) for some small overhead ɛ. [2] J. W. Byers, M. Luby, M. Mitzenmacher, and A. Rege, “A digital fountain approach to reliable distribution of bulk data,” [3] M. Luby, “LT codes,” [4] A. Shokrollahi, “Raptor codes,”

Introduction • This paper design a new family of Fountain codes that combine multiple properties appealing to distributed storage. • systematic form • efficient repair • locality

Introduction • A key observation is that in a systematic linear code, locality is strongly connected to the sparsity of parity symbols [11] • i.e., the maximum number of input symbols combined in a parity symbol. • A parity symbol along with the systematic symbols covered by it form a local group. • The smaller the size of the local group, the lower the locality of the symbols in it. [11] P. Gopalan, C. Huang, H. Simitci, and S. Yekhanin, “On the locality of codeword symbols,”

Introduction • The availabilityof a systematic symbol naturally extends the notion of locality, measuring the number of disjoint local groups the symbol belongs to. • define the availabilityof a systematic symbol as the number of disjoint sets of encoded symbols which can be used to reconstruct that particular symbol.

Problem Description • Given k input symbols, elements of a finite field , wewant to encode them into n symbols using a linear code. • An input vector u ∈ produces a codewordv = uG∈. • by a k × n generator matrix G.

Problem Description • G to have the following properties: • Systematic form • Rateless property • MDS property • Low locality • High Availability

Problem Description • For any code, any sufficiently large subset of encoded symbols should allow recovery of the original data. • In the case of MDS codes, an information theoretically minimum subset of k encoded symbols suffices to decode. • When equipped with systematic form, the generator matrix of an MDS code affords no zero coefficient in the parity generating columns.

Problem Description • This paper require that for ɛ > 0, a set of k’= (1+ ɛ)krandomly selected encoded symbols suffice to decode with high probability. • This property called as near-MDS. • It is impossible to recover the original message with high probability if the parities are linear combinations of fewer than Ω(log k) input symbols.

Prior Work • In LT codes, the average degree of the output symbols, i.e., the number of input symbols combined into an output symbol, is O (log k). • However, that sparsityin this case does not imply good locality, since LT codes lack systematic form.

Prior Work • Raptor codes, a different class of Fountain codes. • The core idea is to precodethe input symbols prior to the application of an appropriate LT code. • The k input symbols can be retrieved in linear time by a set of (1 + ɛ)k encoded symbols • However, the original Raptor design does not feature the highly desirable systematic form. • does not imply good locality.

Prior Work • [4], Shokrollahi provided a construction that yields a systematic flavor of Raptor codes. • Encoding is not applied directly on the input symbols. • Complexity O(). • The source symbols appear in the encoded stream • But the parity symbols are no longer sparse in the original input symbols, [4] A. Shokrollahi, “Raptor codes,”

Prior Work • [16], Gummadiproposes systematic variants of LT and Raptor codes that feature low (even constant) expected repair complexity. • However, the overhead ɛrequired for decoding is suboptimal: it cannot be made arbitrarily small. [16] R. Gummadi, “Coding and scheduling in networks for erasures and broadcast,”

Repairable Fountain Codes • Anew family of Fountain codes that are systematic and also have sparse parities. • Each parity symbol is a random linear combination of up to d = Ω(logk)randomly chosen input symbols. • Require that a set of k’= (1+ ɛ)k randomly selected encoded symbols • with probability of failure vanishing like 1/poly(k).

Repairable Fountain Codes • Given a vector u of k input symbols in • The code is a linear mapping of u to a vector v of higher dimension n through a k × n matrix G. • A single parity symbol is constructed in a two step process.

Repairable Fountain Codes • The parityis the linear combination of the symbols selected in the first step, weighted with the coefficients drawn in the second step. • First, d input symbols are successively selected uniformly at random, independently, with replacement. • Then, a coefficient is uniformly drawn from

Repairable Fountain Codes U is the set of k input symbols V is the set of n encoded symbols.

Repairable Fountain Codes • Returning to the matrix representation • Generator matrices G = [ | P ]. • Every encoded symbol corresponds to a column of G. • P, the part responsible for the construction of the parity symbols • a random matrix whose columns are sparse, each bearing at most d(k) nonzero entries.

Repairable Fountain Codes • In summary, decreasing d(k) improves the locality of the code • How small d(k) can be to ensure that randomly selected set of k’= (1+ɛ)k symbols is decodable.

Repairable Fountain Codes

Repairable Fountain Codes • From the two theorems, it follows that our codes achieve optimal locality with a logarithmic degree for every parity symbol. • Original data is reconstructed in O() using Maximum Likelihood (ML) decoding • Wiedemannalgorithm can reduce complexity to O(polylog(k)) on average.

Repairable Fountain Codes • Theorem 3. Let rk be the total number of parities generated, with each parity symbol constructed as a linear combination of d(k) = c log(k) independently selected symbols uniformly at random with replacement. • The expected number of parities covering a systematic symbol u is

Repairable Fountain Codes Availability

A Graph Perspective • First, consider a balanced random bipartite graph where |U| = |V | = k and each vertex of V is randomly connected to d(k) = c log k nodes in U. • A classical result by Erd˝osand Renyi[14] shows that these graphs will have perfect matchingswith high probability. [14] P. Erd˝os and A. R´enyi, “On random matrices,”

A Graph Perspective • However, the graphs are unbalanced • |U| = k and |V | = k’ = (1+ɛ)k vertices, for ɛ≥ 0

A Graph Perspective

Simulations • This paper experimentally evaluate the probability that decoding fails when a randomly selected subset of encoded symbols is available at the decoder. • set the rate equal to ½. • the generator matrix comprises the k columns of the identity matrix and k parity generating columns. • d(k) = c log(k).

Simulations d(k) = c log(k), with c = 6. k = 100,300,500. Expected decoding overhead ɛ= 1− 2Pe.

Simulations d(k) = c log(k), with c = 4. k = 100,300,500.

Conclusion • This paper introduce a new family of Fountain codes that are systematicand also have parity symbols with logarithmic sparsity.

Conclusion • Main technical contribution is a novel random matrix result: a family of random matrices with non independent entries have full rank with high probability. • The analysis builds on the connections of matrix determinants to flows on random bipartite graphs, using techniques from [14], [15]. [14] P. Erd˝os and A. R´enyi, “On random matrices,” [15] A. G. Dimakis, V. Prabhakaran, and K. Ramchandran, “Decentralized erasure codes for distributed networked storage,”

Conclusion • One disadvantage of this construction is higher decoding complexity O(). • Wiedemann’s algorithm [13] can be used to decode in O(polylogk) time. • O(k log k) for LT and O(k) for Raptor but offer no locality. [13] D. Wiedemann, “Solving sparse linear equations over finite fields,”

Repairable Fountain Codes

Repairable Fountain Codes

Presentation Transcript

Repair Fountain Fountain Valley

Forensic fountain

Fountain head

Fountain Codes

Reliable Multimedia Transmission over Cognitive Radio Networks Using Fountain Codes

Scalable Video Multicast Using Expanding Window Fountain Codes

Trevi Fountain

Fountain Codes

Fountain

Fountain Codes and implementation

Digital Fountain with Tornado Codes and LT Codes

Unequal Error Protection Using Fountain Codes With Applications to Video Communication

Data Persistence in Large-scale Sensor Networks with Decentralized Fountain Codes

REPAIRABLE SYSTEMS

Degree Distribution of XORed Fountain codes

Fountain Codes, LT Codes and Raptor Codes

Hogan’s Fountain

Designing of Fountain Codes with Short Code-Length

Fountain Construction

Threshold Phenomena and Fountain Codes

Threshold Phenomena and Fountain Codes

Large Fountain