Subscribe and Access : 5200+ FREE Videos and 21+ Subjects Like CRT, SoftSkills, JAVA, Hadoop, Microsoft .NET, Testing Tools etc..
Batch
Date: Nov
24th @ 7:00AM
Faculty: Mrs. Sasmitha
Duration: 45 Days
Venue
:
DURGA SOFTWARE SOLUTIONS at Maitrivanam
Plot No : 202,
IInd Floor ,
HUDA Maitrivanam,
Ameerpet, Hyderabad-500038.
Ph.No: +91 - 9246212143, 80 96 96 96 96
Syllabus:
DATA ANALYST with Python, Pandas, Plotly Mongodb, Tableau and Apache Spark
Module-1: Core Python
- Anaconda Python Installation
- Jupyter Notebook Tutorial
- Introduction to Python
- Variable,Comments
- Function
- Lambda Expression
- If Else, While, For Loop
- Taking Input from Keyboard
- List
- Tuple
- Set
- Dictionary
- Coding Test-2
Module-2: Pandas
- Creating Data Frames
- Reading and Writing into Semi Structured Data(CSV,EXCEL,JSON,XML) using DataFrames
- Selection
- Conditional Selection
- Groupby
- Sort values
- Pivot Table
- Merging (left, right, inner, outer)
- Joining
- Concat
- Renaming, droping values
- Series
- Applying Statistical Functions on data
- Assignment-3
Module-3: Data Visualization Using-Plotly
- Scatter plot
- Line Plot
- Spread Chart
- Box plot
- Violin Plot
- Swarm Plot
- Strip Plot
- Maps
- Sunburst Charts
- Bar chart
- Histogram
- Rug plot
- Customizing Plots
- Hover Mode, Size, Color
- Setting Range of Axis
- Rotating axis
Project -1: Creating Covid-19 Dash Board
Module-4: Time Series
- Resampling
- Time shifting
- Rolling,expand
- Trend
- Seasonality
- Time Series Decomposition
- Simple Moving Average
Module-5: Tableau
- Getting Started with Tableau Software
- Connecting your data to Tableau
- Creating line,bar,area,scatter charts etc.
- Tableau Calculation - Sum,AVG and Aggregated features
- Applying new Data Calculation to your Visualization
- Formatting tools and menus
- Sorting Tableau data
- Filtering Tableau data Using Size Panel
Project-2: Forecasting the Airline Traffic
Module-6: Mongodb
- What is NoSQL
- Why NoSql
- Benefits over RDBMS
- Types of NoSQL Databases
- Key/Values
- DataTypes
- Create and Drop DataBase
- Create and Drop Collection
- Create and Drop Documents
- Insert Documents
- Query Document
- Update Document
- Limit Records
- Sort Records
Module-7: Apache Spark
- Introduction to Spark
- Spark Advantages over hadoop
- Apache Spark Architecture
- What is worker node
- What is Driver Program
- What is Cluster Manager
- Master Node
- DAG
- How Spark is Fault Tolerant
- What is a RDD
- Lazy Evaluation
- Actions, Transformation
Module-8: Spark MLlib
- Introduction to Spark MLlib
- What is Linear Regression
- Understand Linear Regression
- Build Machine learning Regression model using Spark MLlib using PySpark