Advanced Data Engineering with Databricks
- Offered byDatabricks
Advanced Data Engineering with Databricks at Databricks Overview
Duration | 16 hours |
Start from | Start Now |
Total fee | ₹1.26 Lakh |
Mode of learning | Online |
Difficulty level | Advanced |
Official Website | Go to Website |
Credential | Certificate |
Advanced Data Engineering with Databricks at Databricks Highlights
- Earn a certificate after completion of course from Databricks
Advanced Data Engineering with Databricks at Databricks Course details
The "Advanced Data Engineering" course is designed for professionals who are looking to deepen their expertise in the field of data engineering and tackle sophisticated data challenges
In this course, students will build upon their existing knowledge of Apache Spark, Structured Streaming, and Delta Lake to unlock the full potential of the data lakehouse by utilizing the suite of tools provided by Databricks
This course places a heavy emphasis on designs favoring incremental data processing, enabling systems optimized to continuously ingest and analyze ever-growing data
Advanced Data Engineering with Databricks at Databricks Curriculum
Day 1
The Lakehouse Architecture
Optimizing Data Storage
Understanding Delta Lake Transactions
Delta Lake Isolation with Optimistic Concurrency
Streaming Design Patterns
Clone for Development and Data Backup
Auto Loader and Bronze Ingestion Patterns
Streaming Deduplication and Quality Enforcement
Slowly Changing Dimensions
Streaming Joins and Statefulness
Day 2
Stored and Materialized Views
Storing Data Securely
Granting Privileged Access to PII
Deleting Data in the Lakehouse
Orchestration and Scheduling with Multi-Task Jobs
Monitoring, Logging, and Handling Errors
Promoting Code with Databricks Repos
Programmatic Platform Interactions (Databricks CLI and REST API)
Managing Costs and Latency with Streaming Workloads