Coursera
Coursera Logo

Duke University - Cloud Data Engineering 

  • Offered byCoursera

Cloud Data Engineering
 at 
Coursera 
Overview

Duration

17 hours

Start from

Start Now

Total fee

Free

Mode of learning

Online

Difficulty level

Intermediate

Official Website

Explore Free Course External Link Icon

Credential

Certificate

Cloud Data Engineering
 at 
Coursera 
Highlights

  • Shareable Certificate Earn a Certificate upon completion
  • 100% online Start instantly and learn at your own schedule.
  • Course 3 of 4 in the Building Cloud Computing Solutions at Scale Specialization
  • Flexible deadlines Reset deadlines in accordance to your schedule.
  • Intermediate Level Beginner level Linux and Python skills
  • Approx. 17 hours to complete
  • English Subtitles: English
Read more
Details Icon

Cloud Data Engineering
 at 
Coursera 
Course details

More about this course
  • Welcome to the third course in the Building Cloud Computing Solutions at Scale Specialization! In this course, you will learn how to apply Data Engineering to real-world projects using the Cloud computing concepts introduced in the first two courses of this series. By the end of this course, you will be able to develop Data Engineering applications and use software development best practices to create data engineering applications. These will include continuous deployment, code quality tools, logging, instrumentation and monitoring. Finally, you will use Cloud-native technologies to tackle complex data engineering solutions.
  • This course is ideal for beginners as well as intermediate students interested in applying Cloud computing to data science, machine learning and data engineering. Students should have beginner level Linux and intermediate level Python skills. For your project in this course, you will build a serverless data engineering pipeline in a Cloud platform: Amazon Web Services (AWS), Azure or Google Cloud Platform (GCP).
Read more

Cloud Data Engineering
 at 
Coursera 
Curriculum

Getting Started with Cloud Data Engineering

Instructor Introduction

Course Introduction

Lab Onboarding

Course 3 Project Overview

Introduction to the End of Moore's Law

The Problem with Concurrency in Python

Exploring the End of Moore's Law

Using CUDA and Numba

What is an ASIC?

Taking Advantage of Colab Pro

Introduction to Distributed Systems

Logging and Instrumentation Distributed Systems

CAP Theorem

Amdahl's Law

Elasticity

Highly Available Nine Nine's

Debugging Python Code

Introduction to Big Data

What is Big Data?

The Three V's of Big Data

Data Lakes

Big Data Processing

Data Feedback Loop

Using EMR Spark to Run Parallel Jobs

Specialization Project Roadmap: Course 3

Course Structure and Discussion Etiquette

The End of Moore's Law

Challenges and Opportunities in Distributed Systems

Challenges in Big Data

Creating high-performance code

Week 1 Quiz

Examining Principles of Data Engineering

Introduction to Data Engineering

What is Data Engineering?

Batch vs. Streaming vs. Events

Building CLI Tools with Click

Building Containerized Command-line Tools

Advanced Testing with Amazon CodeGuru

Advanced Testing with AWS CodeBuild

Mapping Functions to CLI: Part 1

Mapping Functions to CLI: Part 2

Extending Command-Line Data Processing Tool

Week 2 Quiz

Building Data Engineering Pipelines

Introduction to Serverless Data Engineering

What is Serverless?

Serverless Concepts: Service Model

Serverless Concepts: Functions

Serverless Concepts: Ecosystem

Build a Serverless Data Pipeline

Serverless Cookbook with AWS

Serverless Cookbook with GCP

Introduction to Data Governance

What is Data Governance?

The Principle of Least Privilege

Cloud Security with IAM on AWS

AWS Shared Security Model

AWS IAM Service

AWS Cloud Security Operations

Encrypt at Rest and Transit

Using Amazon EFS for AWS Lambda

Building Computer Vision Label Trigger for S3

Week 3 Quiz

Applying Key Data Engineering Tasks

Introduction to Extract, Transform, Load (ETL)

What is ETL?

Using Amazon Athena with AWS Glue

Real-World Problems in ETL

Introduction to Cloud Databases

One Size Does Not Fit All in the Cloud?

Using Google BigQuery

Using AWS Aurora Serverless

Using AWS DynamoDB

Using AWS Redshift

Introduction to Cloud Storage

Why Cloud Storage?

Cloud Storage Deep Dive

Using Amazon S3

Cloud Databases

Cloud Storage Solutions

Create a serverless Data Engineering Pipeline

Week 4 Quiz

Cloud Data Engineering
 at 
Coursera 
Admission Process

    Important Dates

    May 25, 2024
    Course Commencement Date

    Other courses offered by Coursera

    – / –
    3 months
    Beginner
    – / –
    20 hours
    Beginner
    – / –
    2 months
    Beginner
    – / –
    3 months
    Beginner
    View Other 6715 CoursesRight Arrow Icon
    qna

    Cloud Data Engineering
     at 
    Coursera 

    Student Forum

    chatAnything you would want to ask experts?
    Write here...