Courses Offered: SCJP SCWCD Design patterns EJB CORE JAVA AJAX Adv. Java XML STRUTS Web services SPRING HIBERNATE  

       

DATA ANALYST Course Details
 

Subscribe and Access : 5200+ FREE Videos and 21+ Subjects Like CRT, SoftSkills, JAVA, Hadoop, Microsoft .NET, Testing Tools etc..

Batch Date: Nov 24th @ 7:00AM

Faculty: Mrs. Sasmitha

Duration: 45 Days

Venue :
DURGA SOFTWARE SOLUTIONS at Maitrivanam
Plot No : 202, IInd Floor ,
HUDA Maitrivanam,
Ameerpet, Hyderabad-500038.

Ph.No: +91 - 9246212143, 80 96 96 96 96


Syllabus:

DATA ANALYST with Python, Pandas, Plotly Mongodb, Tableau and Apache Spark

Module-1: Core Python

  • Anaconda Python Installation
  • Jupyter Notebook Tutorial
  • Introduction to Python
  • Variable,Comments
  • Function
  • Lambda Expression
  • If Else, While, For Loop
  • Taking Input from Keyboard
  • List
  • Tuple
  • Set
  • Dictionary
  • Coding Test-2

Module-2: Pandas

  • Creating Data Frames
  • Reading and Writing into Semi Structured Data(CSV,EXCEL,JSON,XML) using DataFrames
  • Selection
  • Conditional Selection
  • Groupby
  • Sort values
  • Pivot Table
  • Merging (left, right, inner, outer)
  • Joining
  • Concat
  • Renaming, droping values
  • Series
  • Applying Statistical Functions on data
  • Assignment-3

Module-3: Data Visualization Using-Plotly

  • Scatter plot
  • Line Plot
  • Spread Chart
  • Box plot
  • Violin Plot
  • Swarm Plot
  • Strip Plot
  • Maps
  • Sunburst Charts
  • Bar chart
  • Histogram
  • Rug plot
  • Customizing Plots
  • Hover Mode, Size, Color
  • Setting Range of Axis
  • Rotating axis

Project -1: Creating Covid-19 Dash Board

Module-4: Time Series

  • Resampling
  • Time shifting
  • Rolling,expand
  • Trend
  • Seasonality
  • Time Series Decomposition
  • Simple Moving Average

Module-5: Tableau

  • Getting Started with Tableau Software
  • Connecting your data to Tableau
  • Creating line,bar,area,scatter charts etc.
  • Tableau Calculation - Sum,AVG and Aggregated features
  • Applying new Data Calculation to your Visualization
  • Formatting tools and menus
  • Sorting Tableau data
  • Filtering Tableau data Using Size Panel

Project-2: Forecasting the Airline Traffic

Module-6: Mongodb

  • What is NoSQL
  • Why NoSql
  • Benefits over RDBMS
  • Types of NoSQL Databases
  • Key/Values
  • DataTypes
  • Create and Drop DataBase
  • Create and Drop Collection
  • Create and Drop Documents
  • Insert Documents
  • Query Document
  • Update Document
  • Limit Records
  • Sort Records

Module-7: Apache Spark

  • Introduction to Spark
  • Spark Advantages over hadoop
  • Apache Spark Architecture
    • What is worker node
    • What is Driver Program
    • What is Cluster Manager
    • Master Node
    • DAG
  • How Spark is Fault Tolerant
  • What is a RDD
  • Lazy Evaluation
  • Actions, Transformation

Module-8: Spark MLlib

  • Introduction to Spark MLlib
  • What is Linear Regression
  • Understand Linear Regression
  • Build Machine learning Regression model using Spark MLlib using PySpark