Microsoft - Data Engineering with MS Azure Synapse Apache Spark Pools
- Offered byCoursera
Data Engineering with MS Azure Synapse Apache Spark Pools at Coursera Overview
Duration | 7 hours |
Start from | Start Now |
Total fee | Free |
Mode of learning | Online |
Difficulty level | Intermediate |
Official Website | Explore Free Course |
Credential | Certificate |
Data Engineering with MS Azure Synapse Apache Spark Pools at Coursera Highlights
- Flexible deadlines Reset deadlines in accordance to your schedule.
- Shareable Certificate Earn a Certificate upon completion
- 100% online Start instantly and learn at your own schedule.
- Course 6 of 10 in the Microsoft Azure Data Engineering Associate DP-203 Exam Prep Specialization
Data Engineering with MS Azure Synapse Apache Spark Pools at Coursera Course details
- In this course, you will learn how to perform data engineering with Azure Synapse Apache Spark Pools, which enable you to boost the performance of big-data analytic applications by in-memory cluster computing.
- You will learn how to differentiate between Apache Spark, Azure Databricks, HDInsight, and SQL Pools and understand the use-cases of data-engineering with Apache Spark in Azure Synapse Analytics. You will also learn how to ingest data using Apache Spark Notebooks in Azure Synapse Analytics and transform data using DataFrames in Apache Spark Pools in Azure Synapse Analytics. You will integrate SQL and Apache Spark pools in Azure Synapse Analytics. You will also learn how to monitor and manage data engineering workloads with Apache Spark in Azure Synapse Analytics.
Data Engineering with MS Azure Synapse Apache Spark Pools at Coursera Curriculum
Big Data Engineering
Introduction to the course
What is an Apache Spark pool in Azure Synapse Analytics?
How do Apache Spark pools in Azure Synapse Analytics?
Lesson summary
Introduction to spark notebooks
Understand the use-cases for spark notebooks
Run spark notebooks
Lesson summary
Introduction to DataFrames in Spark pools in Azure Synapse Analytics
Load data in a Spark DataFrame
Flatten nested structures and explode arrays with Apache Spark
Lesson summary
Course syllabus
How to be successful in this course
When do you use Apache Spark pools in Azure Synapse Analytics?
Create a spark notebook in Azure Synapse Analytics
Discover supported languages in spark notebooks
Develop spark notebooks
Develop spark notebooks
Run spark notebooks
Load data in Spark notebooks
Load data in Spark notebooks
Save Spark notebooks
Load data into a Spark DataFrame
Create an Apache Spark table
Flatten nested structures and explode arrays with Apache Spark in synapse
Knowledge check
Knowledge check
Knowledge check
Test prep
Query pools and manage workloads in Azure Synapse Analytics
Describe the integration methods between SQL and Spark pools in Azure Synapse Analytics
Understand the use-cases for SQL and Spark pools integration
Authenticate in Azure Synapse Analytics
Externalize the use of Spark pools within Azure Synapse Workspace
Transfer data outside the Synapse workspace using the PySpark connector
Lesson summary
Monitor Spark pools in Azure Synapse Analytics
Optimize Apache Spark jobs in Azure Synapse Analytics
Lesson summary
Transfer data between SQL and Spark pool in Azure Synapse Analytics
Authenticate between Spark and SQL pool in Azure Synapse Analytics
Integrate SQL and Spark pools in Azure Synapse Analytics
Transfer data outside the Synapse workspace using the PySpark connector
Base-line Apache Spark performance with Apache Spark history server in Azure Synapse Analytics
Automate scaling of Apache Spark pools in Azure Synapse Analytics
Knowledge check
Knowledge check
Test prep
Practice Exam on Perform data engineering with Azure Synapse Apache Spark Pools
Course 6 recap
Course wrap up
About the practice exam
Next steps
Course practice exam