Courses Offered: SCJP SCWCD Design patterns EJB CORE JAVA AJAX Adv. Java XML STRUTS Web services SPRING HIBERNATE  

       

HADOOP Course Details
 

Subcribe and Access : 5200+ FREE Videos and 21+ Subjects Like CRT, SoftSkills, JAVA, Hadoop, Microsoft .NET, Testing Tools etc..

Batch Date: July 4th @ 10:30AM

Faculty: Mr. Pavan

Duration: FREE Workshop

Venue :
DURGA SOFTWARE SOLUTIONS at Maitrivanam
Plot No : 202, IInd Floor ,
HUDA Maitrivanam,
Ameerpet, Hyderabad-500038.

Ph. No: +91 - 9246212143, 80 96 96 96 96



Syllabus:

HADOOP

BigData

  • What is BigData
  • Characterstics of BigData
  • Problems with BigData
  • Handling BigData

Distributed Systems

  • Introduction to Distributed Systems
  • Problems with Existing Distributed Systems to deal BigData
  • Requirements of NewApprocach
  • HADOOP history

HADOOP Core Concepts

  • HDFS
  • MapReduce

HADOOP Cluster

  • Install Pseudo cluster
  • Install Multi node cluster
  • Configuration Introduction to HADOOP Cluster
  • The Five Deamons working
    • NameNode
    • JobTracker
    • SecondaryNameNode
    • TaskTracker
    • DataNode
  • Introduction to HADOOP EcoSystem projects

Writing MapReduce programs

  • Understanding HADOOP API
  • Basic programs of HADOOP MapReduce ApplicationForm
    • Driver Code
    • Mapper Code
    • Reducer Code
  • Eclipse intigration with HADOOP for Rapid Application Development

Understanding ToolRunner

  • More about ToolRunner
  • Combiner
  • Reducer
  • configure and close methods

Common MapReduce Algorithems

  • Sorting
  • Searching
  • Indexing
  • TF-IDF
  • Word_CoOccurance

HADOOP EcoSystem

  • Flume
  • Sqoop
  • Importing data from RDBMS using sqoop
  • Hive
  • Introduction to hive
  • Creating tables in hive
  • Running queries
  • Pig
  • Introduction to pig
  • Different modes of pig
  • when to use hive and when to use pig
  • HBASE
  • Basics of HBASE

Advanced MapReduce Programming

  • Developing custom Writable
  • Developing custom WritableComparable
  • Understanding Input Output formats

Introduction to Ooziee

Hands ons Exercise for each concept

 

SPARK & SCALA

SCALA

  • Why Scala
  • Scala vs java
  • Scala Basics
  • Scala datatypes
  • Varaiable Declarations
  • Control Structures
  • interactive Scala -Scala Shell
  • Scala Scripts
  • Functions
  • Scala ide

Object Oriented Programming in Scala

Traits

  • When to use traits in scala
  • Classes and Objects Basics
  • Case Class
  • Exception Handling in Scala

Functional Programming in Scala:

  • What is Functional Programming
  • Difference between Oops and Functional Programming
  • Clousers in Scala
  • Primary, Auxiliary Constructors
  • Singleton Objects
  • Companion Objects
  • Abstract
  • Classes
  • Currying
  • Expressions
  • Anonymous Functions

SPARK:

  • Spark vs Map Reduce
  • Architecture of Spark
  • Spark Shell introduction
  • Creating Spark Context
  • Spark Project with Maven in Eclipse
  • Cache and Persist in Spark
  • File Operations in Spark

RDD:

  • What is RDD
  • Transformations and Actions
  • Loading data through RDD
  • key-value pair RDD
  • Pair RDD oeprations
  • Running spark application with Spark-shell
  • Deploying Application With Spark-Submit

Spark-SQL:

  • Introduction to Spark SQL
  • Hive vs SparkSQL
  • Processing different fileformats using Spark SQL
  • DataFrames
  • DAG
  • Lineage Graph
  • Cluster types
  • Optimizers
  • Structured Streaming
  • RDDs to Relations

Spark Streaming:

  • Introduction to Spark Streaming
  • Architecture of spark Streaming
  • SparkStreaming vs Flume
  • introduction to Kafka
  • Kafka Architecture
  • Spark Streaming integration with Kafka Overview
  • Real Time Examples