Databricks Data Engineer Associate
- Offered byDatabricks
Databricks Data Engineer Associate at Databricks Overview
Duration | 2 hours |
Total fee | ₹16,432 |
Mode of learning | Online |
Credential | Certificate |
Databricks Data Engineer Associate at Databricks Highlights
- This certification is part of the Data Engineer learning pathway
Databricks Data Engineer Associate at Databricks Course details
- Databricks Lakehouse Platform ? 24% (11/45)
- ELT with Spark SQL and Python ? 29% (13/45)
- Incremental Data Processing ? 22% (10/45)
- Production Pipelines ? 16% (7/45)
- Data Governance ? 9% (4/45)
- The Databricks Certified Data Engineer Associate certification exam assesses an individual?s ability to use the Databricks Lakehouse Platform to complete introductory data engineering tasks
- This includes an understanding of the Lakehouse Platform and its workspace, its architecture, and its capabilities
- It also assesses the ability to perform multi-hop architecture ETL tasks using Apache Spark SQL and Python in both batch and incrementally processed paradigms
Databricks Data Engineer Associate at Databricks Curriculum
Understand how to use and the benefits of using the Databricks Lakehouse Platform and its tools, including:
Data Lakehouse (architecture, descriptions, benefits)
Data Science and Engineering workspace (clusters, notebooks, data storage)
Delta Lake (general concepts, table management and manipulation, optimizations)
Build ETL pipelines using Apache Spark SQL and Python, including:
Relational entities (databases, tables, views)
ELT (creating tables, writing data to tables, cleaning data, combining and reshaping tables, SQL UDFs)
Python (facilitating Spark SQL with string manipulation and control flow, passing data between PySpark and Spark SQL)
Incrementally process data, including:
Structured Streaming (general concepts, triggers, watermarks)
Auto Loader (streaming reads)
Multi-hop Architecture (bronze-silver-gold, streaming applications)
Delta Live Tables (benefits and features)
Build production pipelines for data engineering applications and Databricks SQL queries and dashboards, including:
Jobs (scheduling, task orchestration, UI)
Dashboards (endpoints, scheduling, alerting, refreshing)
Understand and follow best security practices, including:
Unity Catalog (benefits and features)
Entity Permissions (team-based permissions, user-based permissions)