India-flagINDIA : +91-8099902123   usa-flagUSA : 201-710-8393
Call Me Back

training courses

HADOOP Developer online Training

United Global Soft is established with the aim of providing interactive learning. We provide online trainings to candidates from India, USA, UAE, UK, Canada, Singapore, Malaysia, Australia, Saudi Arabia, New Zealand and Many Other Countries. We provide training by a Real Time Professionals. We are driven by our belief in the potential of new technologies to enrich the world.

We provide classroom training in Hyderabad and online training for candidates based in other locations.

We do provide online training on Hadoop Developer by Industry experts.

Course Description

Hadoop Big Data Training course helps you learn the core techniques and concepts of Big Data and Hadoop ecosystem. It equips you with in-depth knowledge of writing codes using MapReduce framework and managing large data sets with HBase. The topics covered in this course mainly includes- Hive, Pig and setup of Hadoop Cluster.

Course Outcomes
  • Understand Big Data and Hadoop ecosystem
  • Work with Hadoop Distributed File System (HDFS)
  • Write MapReduce programs and implementing HBase
  • Write Hive and Pig scripts
Pre-requisites
  • You should have knowledge of programming in C++ or Java or any other Object Oriented Programming language
  • Hadoop Introduction :
      Background and Fundamentals
    • Hadoop Evolution
    • Why Hadoop ?
    • About Bigdata : volume velocity and variety of data
    • Hadoop as an ecosystem and its Daemons
    • Hadoop as a framework and its features
    • Building Hadoop cluster
    • Components in Hadoop eco System
    • Where is Hadoop used?
    • Where Hadoop is not used ?
    • Which companies are using Hadoop?
    • MapReduce Programming Model
    • MapReduce Architecture
    • MR as a Programming model
    • Various data types
    • Snippet of a MR job
    • Mapper, Reducer, and Driver
    • Map side joins
    • Reduce side joins
    • Combiner class(mini reducer)
    • Distributed cache
    • Partitions wrt performance optimization
    • HDFS (Hadoop Distributed File System)
    • significance of HDFS
    • Features of HDFS
    • Daemons of Hadoop
    • Namenode
    • Datanode
    • Secondary Namenode
    • Job Tracker
    • Task Tracker
    Accessing HDFS
  • through CLI
  • Data storage in HDFS
  • replication and blocks
  • 2) Hive
    • Hive Introduction
    • Hive Architecture
    • Hive Query Language (HiveQL) as SQL Dialect
    • Hive as a warehouse
    • Queries in Hive
    • Loading data from various sources into Hadoop
    • various Joins
    • Functions in Hive
    • User defined functions in Hive (UDF)
    • Data Types in Hive
    • Complex data types in Hive
    • working with views in Hive
    • Buckets in Hive
    • Hive Serde
    3) Pig
    • Why Pig when we can work with Hive
    • Similarities between Hive and Pig
    • Differences between hive and Pig
    • Pig Architecture
    • Loading data from various sources into Pig
    • Pig Scripting
    • User defined functions in Pig
    • Working with Complex data types
    • How to store the processed data by Pig permanently ?
    3) Sqoop
    • Introduction to Sqoop
    • Sqoop an utility tool
    • Working with various RDMSs
    • Working with MySql database practical scenario
    • Various import scenarios
    • Export data
    4) Oozie
    • Usage in live scenarios
    • a scheduling tool
    5) HBase
    • Hadoop Database Introduction
    • Storage in HBaseArchitecture of Hbase
    • Scans
    5) Zookeeper
    • Introduction to Zookeeper
    • Usage in Hadoop Cluster
    6) Flume
    • Introduction to Flume
    • Usage in Live applications
    7) Solr
    • Introduction to Solr
    • Usage in Live applications
    8) Hue
  • Introduction to Hue
  • Usage in Live applications 9) Impala
  • Introduction to Impala
  • Usage in Live applications Impala Architecture
  • Data retrieval using Impala
  • 10) Dashboard applications using Tableau
  • Connecting Tableau with Impala
  • 11) Linux basics
  • 12) IDEs: Eclipse
  • 13) IDEs : Netbeans
  • 14) MapReduce programs in Java
  • 15) Code Debugging Techniques
  • 16) creating Jar files and working with Hadoop
  • 17) Performance Tuning
  • 18) Resume preparation
  • 19) Interview Guidance
  • Every Session with Practical scenario is our strength
  • Assistance in installation and Hadoop Cluster set-up
  • Complete Study Material ( soft copy )
  • Lab Assistance
  • If you want to know more about Hadoop Training do not hesitate to call +91-9393 002 123 or mail us on info@unitedglobalsoft.com