70 likes | 181 Vues
Excel Online Classes offers specialized online training services in Hadoop and MapReduce to equip individuals for success in the IT sector. Our curriculum covers essential topics such as understanding MapReduce, custom and default word counts, and the anatomy of MapReduce, including input data handling across nodes. We provide a holistic approach that includes development, testing, job support, technical guidance, and job consultancy to meet all your IT needs. Elevate your skills with our expert-led sessions and take the next step in your career!
E N D
http://www.excelonlineclasses.co.nr/ excel.onlineclasses@gmail.com http://www.excelonlineclasses.co.nr/
Excel Online Classes offers following services: • Online Training • Development • Testing • Job support • Technical Guidance • Job Consultancy • Any needs of IT Sector http://www.excelonlineclasses.co.nr/
Nagarjuna K HDFS & IO formats http://www.excelonlineclasses.co.nr/
AGENDA • Understanding MapReduce • Map Reduce - An Introduction • Word count – default • Word count – custom http://www.excelonlineclasses.co.nr/
Anatomy of MR . INPUT DATA NODE 1 NODE 2 NODE 2 Map Map Map Interim data Interim data Interim data Reduce Reduce Reduce Node to store output Node to store output Node to store output http://www.excelonlineclasses.co.nr/
Hadoop data types • MR has a defined way of keys and values types for it to move across cluster • Values Writable • Keys WritableComparable<T> • WritableComparable = Writable+Comparable<T> http://www.excelonlineclasses.co.nr/
Frequently used key/value http://www.excelonlineclasses.co.nr/