Duke University - Cloud Data Engineering
- Offered byCoursera
Cloud Data Engineering at Coursera Overview
Duration | 17 hours |
Start from | Start Now |
Total fee | Free |
Mode of learning | Online |
Difficulty level | Intermediate |
Official Website | Explore Free Course |
Credential | Certificate |
Cloud Data Engineering at Coursera Highlights
- Shareable Certificate Earn a Certificate upon completion
- 100% online Start instantly and learn at your own schedule.
- Course 3 of 4 in the Building Cloud Computing Solutions at Scale Specialization
- Flexible deadlines Reset deadlines in accordance to your schedule.
- Intermediate Level Beginner level Linux and Python skills
- Approx. 17 hours to complete
- English Subtitles: English
Cloud Data Engineering at Coursera Course details
- Welcome to the third course in the Building Cloud Computing Solutions at Scale Specialization! In this course, you will learn how to apply Data Engineering to real-world projects using the Cloud computing concepts introduced in the first two courses of this series. By the end of this course, you will be able to develop Data Engineering applications and use software development best practices to create data engineering applications. These will include continuous deployment, code quality tools, logging, instrumentation and monitoring. Finally, you will use Cloud-native technologies to tackle complex data engineering solutions.
- This course is ideal for beginners as well as intermediate students interested in applying Cloud computing to data science, machine learning and data engineering. Students should have beginner level Linux and intermediate level Python skills. For your project in this course, you will build a serverless data engineering pipeline in a Cloud platform: Amazon Web Services (AWS), Azure or Google Cloud Platform (GCP).
Cloud Data Engineering at Coursera Curriculum
Getting Started with Cloud Data Engineering
Instructor Introduction
Course Introduction
Lab Onboarding
Course 3 Project Overview
Introduction to the End of Moore's Law
The Problem with Concurrency in Python
Exploring the End of Moore's Law
Using CUDA and Numba
What is an ASIC?
Taking Advantage of Colab Pro
Introduction to Distributed Systems
Logging and Instrumentation Distributed Systems
CAP Theorem
Amdahl's Law
Elasticity
Highly Available Nine Nine's
Debugging Python Code
Introduction to Big Data
What is Big Data?
The Three V's of Big Data
Data Lakes
Big Data Processing
Data Feedback Loop
Using EMR Spark to Run Parallel Jobs
Specialization Project Roadmap: Course 3
Course Structure and Discussion Etiquette
The End of Moore's Law
Challenges and Opportunities in Distributed Systems
Challenges in Big Data
Creating high-performance code
Week 1 Quiz
Examining Principles of Data Engineering
Introduction to Data Engineering
What is Data Engineering?
Batch vs. Streaming vs. Events
Building CLI Tools with Click
Building Containerized Command-line Tools
Advanced Testing with Amazon CodeGuru
Advanced Testing with AWS CodeBuild
Mapping Functions to CLI: Part 1
Mapping Functions to CLI: Part 2
Extending Command-Line Data Processing Tool
Week 2 Quiz
Building Data Engineering Pipelines
Introduction to Serverless Data Engineering
What is Serverless?
Serverless Concepts: Service Model
Serverless Concepts: Functions
Serverless Concepts: Ecosystem
Build a Serverless Data Pipeline
Serverless Cookbook with AWS
Serverless Cookbook with GCP
Introduction to Data Governance
What is Data Governance?
The Principle of Least Privilege
Cloud Security with IAM on AWS
AWS Shared Security Model
AWS IAM Service
AWS Cloud Security Operations
Encrypt at Rest and Transit
Using Amazon EFS for AWS Lambda
Building Computer Vision Label Trigger for S3
Week 3 Quiz
Applying Key Data Engineering Tasks
Introduction to Extract, Transform, Load (ETL)
What is ETL?
Using Amazon Athena with AWS Glue
Real-World Problems in ETL
Introduction to Cloud Databases
One Size Does Not Fit All in the Cloud?
Using Google BigQuery
Using AWS Aurora Serverless
Using AWS DynamoDB
Using AWS Redshift
Introduction to Cloud Storage
Why Cloud Storage?
Cloud Storage Deep Dive
Using Amazon S3
Cloud Databases
Cloud Storage Solutions
Create a serverless Data Engineering Pipeline
Week 4 Quiz