________________________________________________________________
Do you want to take this course remotely or in person?
Contact us by email: info@nanforiberica.com , phone: +34 91 031 66 78, WhatsApp: +34 685 60 05 91 , or contact Our Offices
________________________________________________________________
Course DP-3012: Implementing a Data Analytics Solution with Azure Synapse Analytics
This is a course designed to train students on dedicated Spark pools and SQL Serverless and provide instructions on data cleansing and ELT process using Synapse pipelines, which is very similar to those familiar with Azure Data Factory (ADF) to move data to Synapse dedicated pool database.
Intermediate - Azure Microsoft Fabric - Data Engineer Administrator
Course aimed at
The audience should be familiar with notebooks that use different languages and a Spark engine, such as Databricks, Jupyter Notebooks, Zeppelin Notebooks, and more. They should also have some experience with SQL, Python, and Azure tools, such as Data Factory.
DP-3012 Training Objectives
- Identifying the business problems that Azure Synapse Analytics solves.
- Description of the main functionalities of Azure Synapse Analytics.
- Determining when to use Azure Synapse Analytics.
Course content DP-3012
Module 1: Introduction to Azure Synapse Analytics
- What is Azure Synapse Analytics
- How Azure Synapse Analytics works
- When to use Azure Synapse Analytics
- Exercise: Exploring Azure Synapse Analytics
Module 2: Using an Azure Synapse Serverless SQL Pool to Query Files in a Data Lake
- Understanding the capabilities and use cases for Azure Synapse serverless SQL pools
- Querying files using a serverless SQL pool
- Creating external database objects
- Exercise: Querying files using a serverless SQL pool
Module 3: Data Analysis with Apache Spark in Azure Synapse Analytics
- Introduction to Apache Spark
- Using Spark in Azure Synapse Analytics
- Data analysis with Spark
- Data Visualization with Spark
- Exercise: Data Analysis with Spark
Module 4: Using Delta Lake in Azure Synapse Analytics
- Description of Delta Lake
- Creating Delta Lake Tables
- Creating catalog tables
- Using Delta Lake with Streaming Data
- Using Delta Lake in a SQL pool
- Exercise: Using Delta Lake in Azure Synapse Analytics
Module 5: Data Analysis in a Relational Data Warehouse
- Design a data storage scheme
- Create data warehouse tables
- Loading data warehouse tables
- Query a data warehouse
- Exercise: Exploring a Data Warehouse
Module 6: Creating a Data Pipeline in Azure Synapse Analytics
- Understanding pipelines in Azure Synapse Analytics
- Creating a pipeline in Azure Synapse Studio
- Definition of data flows
- Running a pipeline
- Exercise: Creating a data pipeline in Azure Synapse Analytics
Prerequisites
Familiarity with Azure services and experience with Azure Machine Learning and MLflow are recommended. Additionally, you should have experience performing machine learning-related tasks using Python .
Language
- Course: English / Spanish
- Labs: English / Spanish