MapReduce
MapReduce. CSE 454. Slides based on those by Jeff Dean, Sanjay Ghemawat, and Dan Weld. What’s the Problem?. So far… Classification + IR Simple enough, counting a bunch of words… 100 TB datasets Scanning on 1 node – 23 days On 1000 nodes – 33 mins Sounds great, but what about MTBF?
627 views • 52 slides