course detail

Hadoop

Learn Hadoop Training In Chennai At BITA  Academy– No 1 Hadoop Training Institute In Chennai. Call 956600-4616 For More Details. Register today for learning Hadoop in Chennai.


Hadoop is an open source, Java-based programming framework that supports the processing and storage of extremely large data sets in a distributed computing environment. It is part of the Apache project sponsored by the Apache Software Foundation.

Hadoop makes it possible to run applications on systems with thousands of commodity hardware nodes, and to handle thousands of terabytes of data

Hadoop is here to stay and lead the industry in helping the business with numerous ways to store, retrieve and analyze data.

HADOOP SYLLABUS:

INTRODUCTION

Big Data

3Vs

Role of Hadoop in Big data

Hadoop and its ecosystem

Overview of other Big Data Systems

Requirements in Hadoop

UseCases of Hadoop

HDFS

Design

Architecture

Data Flow

CLI Commands

Java API

Data Flow Archives

Data Integrity

WebHDFS

Compression

MAPREDUCE

Theory

Data Flow (Map – Shuffle – Reduce)

Programming [Mapper, Reducer, Combiner, Partitioner]

Writables

InputFormat

Outputformat

Streaming API

ADVANCED MAPREDUCE PROGRAMMING

Counters

CustomInputFormat

Distributed Cache

Side Data Distribution

Joins

Sorting

ToolRunner

Debugging

Performance Fine tuning 

ADMINISTRATION – Information required at Developer level

Hardware Considerations – Tips and Tricks

Schedulers

Balancers

NameNode Failure and Recovery

HBase

NoSQL vs SQL

CAP Theorem

Architecture

Configuration

Role of Zookeeper

Java Based APIs

MapReduce Integration

Performance Tuning

HIVE

Architecture

Tables

DDL – DML – UDF – UDAF

Partitioning

Bucketing

Hive-Hbase Integration

Hive Web Interface

Hive Server

OTHER HADOOP ECOSYSTEMS

Pig (Pig Latin , Programming)

Sqoop (Need – Architecture ,Examples)

Introduction to Components (Flume, Oozie,ambari)

 

Free Demo Classes