+1 844 678 44 27

  Hadoop and Architecture Online Training and Placement

  Hadoop and Architecture Training for OPT CPT Students

Online Training Courses
.Net Online Training Courses Java Online Training Courses SAP Online Training Courses Oracle Online Training Courses Software Testing Online Training Courses C Programming Online Training Courses C++ Programming Online Training Courses jQuery Online Training Courses UI Design Online Training Courses UNIX Online Training Courses Javascript Online Training Courses SQL & PLSQL Online Training Courses SAS Online Training Courses BI Reporter Tool Online Training Courses Microstrategy Online Training Courses Mainframes Online Training Courses Business Objects Online Training Courses Business Analysis Online Training Courses BPM Online Training Courses Hadoop Online Training Courses DataStage Online Training Courses Android Online Training Courses Linux Online Training Courses OBIEE Online Training Courses IBM Tools Online Training Courses PHP Online Training Courses AJAX Online Training Courses BizTalk Online Training Courses Data Warehousing Online Training Courses Embedded Systems Online Training Courses Google Technologies Online Training Courses Microsoft Technologies Online Training Courses Oracle DBA Online Training Courses Oracle Datawarehouse Online Training Courses Oracle E-Business Suite Online Training Courses PeopleSoft Online Training Courses Project Management Online Training Courses Autocad Online Training Courses Salesforce Online Training Courses Sharepoint Online Training Courses Softskills Online Training Courses System Networking Online Training Courses Web Design Online Training Courses BizTalk Server Online Training Courses Database Online Training Courses iPhone App Development Online Training Courses Ruby On Rails Online Training Courses SEO Online Training Courses Shell Scripting Online Training Courses VMWare Online Training Courses Webmethods Online Training Courses Websphere Online Training Courses Visual Basic Online Training Courses TIBCO SOA Online Training Courses COGNOS Online Training Courses Perl Scripting Online Training Courses Hyperion Online Training Courses Informatica Online Training Courses Python Online Training Courses QlikView Course Contents QA Online Training Courses Tableau Tool Online Training Courses Basic .Net Training Course
More    


Delving Deep into Hadoop – Course Contents

 


Introduction to Hadoop and Architecture

 



 Hadoop 1.0 Architecture

 


  • Introduction to Hadoop & Big Data
  • Hadoop Evolution
  • Hadoop Architecture
  • Networking Concepts
  • Use cases - Where Hadoop fits into

 

Hadoop 2.0 Architecture

 

  • Limitations on Hadoop 1.0 Architecture
  • Features of Hadoop 2.0 Architecture
  • HDFS Federation
  • High Availability of Name Node
  • YARN – Yet Another Resource Negotiator
  • Developing Applications on YARN
  • Non MR applications on top of YARN

Quiz on Architecture Concepts



Cluster Installation


Hadoop Cluster Installation

 


  • Types of Hadoop Cluster
  • Installing Pseudo Mode Cluster
  • Walk thru on inbuilt scripts, directories, configuration files and port numbers.
  • Discussion on Real Time Cluster Size 

Detailed documentation on Installation Procedure



Distributed File System - HDFS

 

 

HDFS Commands

 

  • Introduction to HDFS Commands
  • Discussion on scenarios where specific commands are applicable
  • Introduction to Advanced HDFS Commands including fine tuning of cluster 

Detailed documentation on all the HDFS Commands

Custom Script building using HDFS & Unix commands

Quiz on HDFS Commands




Map Reduce - MR

 

 

Map Reduce using Java


  • Introduction to Map Reduce Architecture
  • Detailed discussion on different phases of MR
    • Mapper
    • Reducer
    • Splitting
    • Sorting
    • Shuffling
    • Combiner
    • Partitioning
  • Developing Map Reduce Application from Scratch using different use cases
  • Discussion of difference between Old MR API & New MR API
  • Introduction to different file formats and their internal features (Sequential, Binary etc.,)
  • Analytics using MR on to derive Banking Solution 

Case Study on Map Reduce (Customer Sentiment Analyser)


Map Reduce using Python – Streaming

 

  • Developing Map Reduce Application using Python
  • Discussion of different features available in Streaming 

Case Study on Map Reduce Streaming (Analytics on Temperature Datasets)

Quiz on Map Reduce


Hadoop Eco System Components

 

 

Hive (Data Warehouse on top of HDFS)

 

  • Introduction to Hive Architecture
  • Configuring Hive Metadata store in different ways
  • Basic Queries in Hive (DDL, DML)
  • Advanced features of Hive
    • Partitioning
    • Bucketing
    • Sampling
    • Multi Table Load Queries
    • Serialize & De Serialize
  • Dealing with different formats of data (Flat file, JSON, CSV etc.,)
  • Query optimization using Hive.
  • Developing User Defined Functions (UDF’s) in Java & Python

 

Case Study (Analytics on Telecom Datasets)

Quiz on Hive



PIG (Data Flow Language)


  • Introduction to Pig Latin
  • Basic Commands in Pig
  • Explanation advanced features of Pig with real time scenarios
  • Different ways of using PigStorage
  • Dealing with Unstructured data
  • Developing Regular Expressions
  • Developing User Defined Functions (UDF’s) in Java & Python

 

Case Study (Analytics on Books Datasets)

Quiz on Pig



SQOOP (Import – Export utility)

 

  • Introduction to Sqoop
  • Basic Sqoop Commands
  • Advanced Import Features
  • Advanced Export Features
  • Upsert Calls
  • EVAL
  • Compressed Formats

Case Study (Analytics on Telecom Datasets)

Quiz on Sqoop



HBASE (Versioned Database)



  • Introduction to HBASE & NOSQL
  • Basic difference in Row Oriented and Column Oriented storage
  • Basic HBASE Commands
  • Advanced HBASE Features
    • Versions
    • Compression Techniques
    • Bloom Filters
    • Sequential Scans
  • Bulk Loads to HBASE Features 

Case Study on HBASE

Quiz on HBASE



Flume



  • Flume Architecture
  • Configuring Flume Components
    • Source
    • Sink
    • Channel
    • Agents
  • Building Flume Config files for different scenarios
    • Basic Config File building
    • Config file for connecting to different File Servers
    • Config file for connecting to Web Servers

Quiz on Flume



Spark


 

  • Introduction to Spark and In-memory applications
  • Understanding RDD (Resilient Distributed Dataset)
  • Spark Context and Spark SQL Context
  • Introduction to MLib, Streaming

Quiz on Spark



Kafka


 

  • Introduction to Kafka architecture
  • Single and Multi-Broker configuration
  • Java Sample Producer
  • Integration with Hadoop (Flume) and Kafka

Quiz on Kafka

 


Finally this series of Practical Sessions ends with Quiz on entire course.

 

 

Upsert calls


Register Now!

Free Demo Class
Register for a FREE DEMO class to attend IT Training courses with our experts online trainers. Register now by filling the below form. Take a DEMO and see how our online training courses work for you.






Which course is right for me?

Speak to an IT online learning consultant today

on 844-OPTGHAR for FREE Tailored Advice.
get free advice
course