Courses Offered: SCJP SCWCD Design patterns EJB CORE JAVA AJAX Adv. Java XML STRUTS Web services SPRING HIBERNATE  

       

ADB with PYSPARK Course Details
 

Subcribe and Access : 5200+ FREE Videos and 21+ Subjects Like CRT, SoftSkills, JAVA, Hadoop, Microsoft .NET, Testing Tools etc..

Batch Date: Nov 6th @6:30AM

Faculty: Mr. Sameer (10+ Yrs of Exp,.. & Real Time Expert)

(Leading Faculty in Twin Cities)

Duration : 2 Months

Venue :
DURGA SOFTWARE SOLUTIONS,
Flat No : 202, 2nd Floor,
HUDA Maitrivanam,
Ameerpet, Hyderabad - 500038

Ph.No: +91 - 9246212143, 80 96 96 96 96

Syllabus:

ADB with PYSPARK

Module 1: Cloud Computing Concepts

  • What is the "Cloud" ?
  • Why cloud services
  • Types of cloud models
    • Deployment Models
    • private Cloud deployment model
    • public Cloud deployment model
    • hybrid cloud deployment model
    • Microsoft Azure,
    • Amazon Web Services,
    • Google Cloud Platform
    • characteristics of cloud computing
    • On-demand self-service
    • Broad network access
    • Multi-tenancy and resource pooling
    • Rapid elasticity and scalability
    • Measured service
    • Cloud Data Warehouse Architecture
    • Shared Memory architecture
    • Shared Disk architecture
    • Shared Nothing architecture

Module 2: Core Azure Services

  • Core Azure Architectural components
  • Core Azure Services and Products
  • Azure solutions
  • Azure management tools

Module 3: Security, Privacy, Compliance

  • Securing network connectivity
  • Core Azure identity services
  • Security tools and features
  • Azure Governance methodologies
  • Monitoring and reportings
  • Privacy, compliance, and data protection standards

Module 4: Azure Pricing and Support

  • Azure subscriptions
  • Planning and managing costs
  • Azure support options
  • Azure Service Level Agreements (SLAs)
  • Service Lifecycle in Azure

Module 5: Introduction to Azure Databricks

  • Introduction to Databricks
  • Azure Databricks Architecture
  • Azure Databricks Main Concepts

Module 6: Azure Databricks Account Creation

  • Azure Free Account
  • Free Subscription for Azure Databricks
  • Create Databricks Community Edition Account

Module 7: Databricks Cluster Types and Notebook Options

  • Creating and configuring clusters
  • create Notebook
  • quick tour on notebook options

Module 8: Databricks Utilities and Notebook Parameters

  • Dbutils commands on files, directories
  • Notebooks and libraries
  • Databricks Variables
  • Widget Types
  • Databricks notebook parameters

Module 9: Databricks CLI

  • Azure Databricks CLI Installation
  • Databricks CLI - DBFS, Libraries and Jobs

Module 10: Databricks Integration with Azure Blob Storage

  • Read data from Blob Storage and Creating Blob mount point

Module 11: Databricks Integration with Azure Data Lake Storage Gen2

  • Reading files from Azure Data Lake Storage Gen2

Module 12: Databricks Integration with Azure Data Lake Storage Gen1

  • Reading Files from data lake storage Gen1

Module 13: Reading and Writing CSV files in Databricks

  • Read CSV Files
  • Read TSV Files and PIPE Seperated CSV Files
  • Read CSV Files with multiple delimiter in spark 2 and spark 3
  • Reading different position Multidelimiter CSV files

Module 14: Reading and Writing Parquet files in Databricks

  • Read Parquet files from Data Lake Storage Gen2
  • Reading and Creating Partition files in Spark

Module 16: Parsing Complex Json FilesL

  • Reading and Writing JSON Files
  • Reading, Transforming and Writing Complex JSON files

Module 17: Reading and Writing ORC and Avro Files

  • Reading and Writing ORC and Avro Files

Module 19: Databricks Integration with Azure Synapse

  • Reading and Writing Azure Synapse data from Azure Databricks

Module 20: Databricks Integration with Amazon Redshift(Redshift)

  • Read and Write data from Redshift using databricks

Module 21: Databricks Integration with Snowflake

  • Reading and Writing data from Snowflake

Module 22: Databricks Integration with CosmosDB SQL API

  • Reading and Writing data from Azure CosmosDB Account

Module 23: Python Introduction

  • Python Introduction
  • Installation and setup
  • Python Data Types for Azure Databricks

Module 24: Python Data Types

  • Deep dive into String Data Types in Python for Azure Databricks
  • Deep dive into python collection list and tuple
  • Deep dive on set and dict data types in python

Module 25: Python Functions and Arguments

  • Python Functions and Arguments
  • Lambda Functions

Module 26: Python Modules and Packages

  • Python Modules and Packages

Module 27: Python Flow Control

  • Python Flow Control
  • For-Each
  • While

Module 28: Python File Handling

  • Python File Handling

Module 29: Python Logging Module

  • Python Logging Module

Module 30: Python Exception Handling

  • Python Exception Handlings

Module 31: Pyspark Introduction

  • Pyspark Introduction
  • Pyspark Components and Features

Module 32: Spark Architecture and Internals

  • Apache Spark Internal architecture
  • jobs stages and tasks
  • Spark Cluster Architecture Explained

Module 33: Spark RDD

  • Different Ways to create RDD in Databricks
  • Spark Lazy Evaluation Internals & Word Count Program
  • RDD Transformations in Databricks & coalesce vs repartition
  • RDD Transformation and Use Cases

Module 34: Spark SQL

  • Spark SQL Introduction
  • Different ways to create DataFrames

Module 35: Spark SQL Intenals

  • Catalyst Optimizer and Spark SQL Execution Plan
  • Deep dive on Sparksession vs sparkcontext
  • spark SQL Basics part-1
  • RDD Transformation and Use Cases

Module 36: Spark SQL Basics

  • Spark SQL Basics Part-2
  • Joins in Spark SQL

Module 37: Spark SQL Functions and UDFs

  • Spark SQL Functions part-1
  • Spark SQL Functions part-2
  • Spark SQL Functions Part-3
  • Spark SQL UDFs
  • Spark SQL Temp tables and Joins

Module 38: Databricks Delta and Implementing Dimensions SCD1 and SCD2

  • Implementing SCD Type1 and Apache Spark Databricks Delta
  • Delta Lake in Azure Databricks
  • Implementing SCD Type with and without Databricks Delta

Module 39: Databricks Integration with Azure Data Factory

  • Azure Data Factory Integration with Azure Databricks

Module 40: Databricks Streaming

  • Delta Streaming in Azure Databricks
  • Data Ingestion with Auto Loader in Azure Databricks

Module 41: Azure Databricks Projects

  • Azure Databricks Project-1
  • Azure Databricks Project-2

Module 42: Databricks Integration with Azure Devops

  • Azure Databricks CICD Pipelines