70 likes | 211 Views
MapReduce. Powering Hadoop. Overview. Overview What is MapReduce How Does It Divide Work Example Conclusion References. What Is MapReduce. Originally created by Google Used to query large data-sets Extracts relations from unstructured data Can draw from many disparate data sources.
E N D
MapReduce Powering Hadoop
Overview • Overview • What is MapReduce • How Does It Divide Work • Example • Conclusion • References
What Is MapReduce • Originally created by Google • Used to query large data-sets • Extracts relations from unstructured data • Can draw from many disparate data sources
How It Divides Work http://docs.basho.com/riak/1.3.0/tutorials/querying/MapReduce/
Example • After the Boston Marathon bombing • Twitter and other social media sites were analyzed • Searched for keywords: bomb, terrorist, attack, etc. • Related the users making and mentioned in posts to their geographic location • Used to find “hotspots” • Helped to narrow the search for those responsible
Conclusion • Pits computing power against unstructured data • Can be used to index and search unstructured data • Lends itself well to distributed computing systems
References • http://docs.basho.com/riak/1.3.0/tutorials/querying/MapReduce/ • http://www.informationweek.com/big-data/news/software-platforms/how-foreign-names-trip-up-terrorism-data/240154159 • http://blog.data-miners.com/2010/01/hadoop-and-mapreduce-normalizing-data.html