70 likes | 179 Views
http://www.excelonlineclasses.co.nr/ excel.onlineclasses@gmail.com. Excel Online Classes offers following services :. Online Training Development Testing Job support Technical Guidance Job Consultancy Any needs of IT Sector. Nagarjuna K. HDFS & IO formats. AGENDA.
E N D
http://www.excelonlineclasses.co.nr/ excel.onlineclasses@gmail.com http://www.excelonlineclasses.co.nr/
Excel Online Classes offers following services: • Online Training • Development • Testing • Job support • Technical Guidance • Job Consultancy • Any needs of IT Sector http://www.excelonlineclasses.co.nr/
Nagarjuna K HDFS & IO formats http://www.excelonlineclasses.co.nr/
AGENDA • Understanding MapReduce • Map Reduce - An Introduction • Word count – default • Word count – custom http://www.excelonlineclasses.co.nr/
Anatomy of MR . INPUT DATA NODE 1 NODE 2 NODE 2 Map Map Map Interim data Interim data Interim data Reduce Reduce Reduce Node to store output Node to store output Node to store output http://www.excelonlineclasses.co.nr/
Hadoop data types • MR has a defined way of keys and values types for it to move across cluster • Values Writable • Keys WritableComparable<T> • WritableComparable = Writable+Comparable<T> http://www.excelonlineclasses.co.nr/
Frequently used key/value http://www.excelonlineclasses.co.nr/