Develop Big Data processing applications using Apache Hadoop, learning essential tools for managing and analyzing large datasets in Hadoop ecosystems.

Course Objectives

  • Hadoop & Big Data
  • HDFS
  • MapReduce
  • Pig
  • Hive

Upcoming Schedules

Course Prerequisites

Required

  • Comfortable with Java programming language (most programming exercises are in java)
  • Comfortable in Linux environment (be able to navigate Linux command line, edit files using vi / nano)

Course Outline

Introduction to Hadoop
arrow iconarrow icon

  • Hadoop history, concepts
  • Eco system
  • Distributions
  • High level architecture
  • Hadoop myths
  • Hadoop challenges
  • Hardware / software
  • Lab : first look at Hadoop