Coursera
Coursera Logo

Data for Machine Learning 

  • Offered byCoursera

Data for Machine Learning
 at 
Coursera 
Overview

Duration

12 hours

Total fee

Free

Mode of learning

Online

Difficulty level

Intermediate

Official Website

Explore Free Course External Link Icon

Credential

Certificate

Data for Machine Learning
 at 
Coursera 
Highlights

  • This Course Plus the Full Specialization.
  • Shareable Certificates.
  • Graded Programming Assignments.
Details Icon

Data for Machine Learning
 at 
Coursera 
Course details

More about this course
  • This course is all about data and how it is critical to the success of your applied machine learning model. Completing this course will give learners the skills to:
  • Understand the critical elements of data in the learning, training and operation phases
  • Understand biases and sources of data
  • Implement techniques to improve the generality of your model
  • Explain the consequences of overfitting and identify mitigation measures
  • Implement appropriate test and validation measures.
  • Demonstrate how the accuracy of your model can be improved with thoughtful feature engineering.
  • Explore the impact of the algorithm parameters on model strength
  • To be successful in this course, you should have at least beginner-level background in Python programming (e.g., be able to read and code trace existing code, be comfortable with conditionals, loops, variables, lists, dictionaries and arrays). You should have a basic understanding of linear algebra (vector notation) and statistics (probability distributions and mean/median/mode).
  • This is the third course of the Applied Machine Learning Specialization brought to you by Coursera and the Alberta Machine Intelligence Institute.
Read more

Data for Machine Learning
 at 
Coursera 
Curriculum

What Does Good Data look like?

Introduction to the Course

Business Understanding and Problem Discovery

No Free Lunch Theorem

Exploring the process of problem definition

Data Acquisition and Understanding

Metadata Matters

Dealing with Multimodal Data

Features and transformations of raw data

Identifying Data from Problem

Case Study: Problem from Data

Weekly Summary What does good data look like?

Machine Learning Process Lifecycle Review

Match Data to the needs of the learning Algorithm

Business Understanding and Problem Discovery (BUPD) Review

Data Acquisition and Understanding Review

Module 1 Quiz

Preparing your Data for Machine Learning Success

Data Warehousing

Converting to Useful Forms

Data Quality

How Much Data Do I Need?

Everything has to be Numbers

Types of Data

Aligning Similar Data

Imputing Missing Values

Data Transformations

Weekly Summary: Preparing your Data for Machine Learning Success

Data Cleaning: Everybody's favourite task

Data Warehousing Review

Everything has to be Numbers Review

Types of Data Review

Module 2 Quiz

Feature Engineering for MORE Fun & Profit

What are the simplest Features to try

Useful/Useless Features

How Many Features?

What is Unsupervised Learning

Feature Selection

Feature Extraction

Transfer Learning

Weekly Summary: Feature Engineering for MORE Fun & Profit

Possibilities for Text Features

Word Embeddings

Understanding Features

Building Good Features

Understanding Transfer Learning

Bad Data

Imbalanced Data

Generalization and how machines actually learn

Bias in Data Sources

Bias and variance tradeoff

Outliers

Skewed Distributions

Badness Multipliers

Live Data Danger

Weekly Summary: Bad Data

Mistakes Computers Make

Data: Skewed Distributions

Live Data Dangers

Module 4 Quiz

Other courses offered by Coursera

– / –
3 months
Beginner
– / –
20 hours
Beginner
– / –
2 months
Beginner
– / –
3 months
Beginner
View Other 6716 CoursesRight Arrow Icon
qna

Data for Machine Learning
 at 
Coursera 

Student Forum

chatAnything you would want to ask experts?
Write here...