Hadoop Developer Training

Welcome to the Course!

  • Become a Certified Hadoop Developer
  • Welcome to the Course

Introduction to Hadoop

  • Big Data – Big value
  • Understanding Big Data
  • Hadoop and other Solutions
  • Distributed Architecture – A Brief Overview
  • Hadoop Releases

Hadoop Setup

  • Setup Hadoop
  • Linux (Ubuntu) – Tips and Tricks
  • HDFS commands
  • Running a MapRed Program

HDFS Architecture and Concepts

  • HDFS Concepts
  • HDFS Architecture
  • HDFS Read and Write
  • Special Commands

Understanding MapReduce

  • MapReduce Introduction
  • Understanding MapReduce
  • Running First MapReduce Program
  • Combiner And Tool Runner
  • Recap Map, Reduce and Combiner Part

MapReduce Types and Formats

  • MapReduce Types and Formats
  • Experiments with Defaults
  • IO Format Classes
  • Experiments with File Output – Advanced Concept

Classic MapReduce and YARN

  • Anatomy of MapReduce job run
  • Job Run- Classic MapReduce
  • Failure Scenarios – Classic Map Reduce
  • Job Run – YARN
  • Failure Scenario – YARN
  • Job Scheduling in MapReduce
  • Shuffle and Sort
  • Performance Tuning Features

Advanced MapReduce Concepts

  • Looking at Counters
  • Hands on – Counters
  • Sorting Ideas with Partitioner – Part 1
  • Sorting Ideas with Partitioner – Part 2
  • Map Side Join Operation
  • Reduce Side Join Operation
  • Side Distribution of Data
  • Hadoop Streaming and Hadoop Pipes

Introduction to Hadoop Ecosystem

  • Introduction to Pig
  • Introduction to Hive
  • Introduction to Sqoop
  • Knowing Sqoop
  • Introduction to Ecosystem

Final Frontier: Preparation for CDH-410 Certification Exams

  • Final Exam Part 1
  • Final Exam Part 2
  • Quiz 33
  • Questions and Answer
  • Final Exam Part 3