Coursera
Coursera Logo

Modeling Data in the Tidyverse 

  • Offered byCoursera

Modeling Data in the Tidyverse
 at 
Coursera 
Overview

Duration

21 hours

Start from

Start Now

Total fee

Free

Mode of learning

Online

Official Website

Explore Free Course External Link Icon

Credential

Certificate

Modeling Data in the Tidyverse
 at 
Coursera 
Highlights

  • Shareable Certificate Earn a Certificate upon completion
  • 100% online Start instantly and learn at your own schedule.
  • Course 5 of 5 in the Tidyverse Skills for Data Science in R Specialization
  • Flexible deadlines Reset deadlines in accordance to your schedule.
  • Approx. 21 hours to complete
  • English Subtitles: English
Read more
Details Icon

Modeling Data in the Tidyverse
 at 
Coursera 
Course details

More about this course
  • Developing insights about your organization, business, or research project depends on effective modeling and analysis of the data you collect. Building effective models requires understanding the different types of questions you can ask and how to map those questions to your data. Different modeling approaches can be chosen to detect interesting patterns in the data and identify hidden relationships.
  • This course covers the types of questions you can ask of data and the various modeling approaches that you can apply. Topics covered include hypothesis testing, linear regression, nonlinear modeling, and machine learning. With this collection of tools at your disposal, as well as the techniques learned in the other courses in this specialization, you will be able to make key discoveries from your data for improving decision-making throughout your organization.
  • In this specialization we assume familiarity with the R programming language. If you are not yet familiar with R, we suggest you first complete R Programming before returning to complete this course.
Read more

Modeling Data in the Tidyverse
 at 
Coursera 
Curriculum

Modeling Data Basics

Course Textbook

The Purpose of Data Science

Types of Data Science Questions

Data Needs

Number of observations is too small

Dataset does not contain the exact variables you are looking for

Variables in the dataset are not collected in the same year

Dataset is not representative of the population that you are interested in

Some variables in the dataset are measured with error

Variables are confounded

Descriptive and Exploratory Data Analysis

Missing Values

Shape

Identifying Outliers

Evaluating Variables

Evaluating Relationships

Modeling Data Basics Quiz

Inference

Inference

Uncertainty

Random Sampling

Inference Quiz

Linear Modeling

Linear Regression

Assumptions

Association

Association Testing in R

Fitting the Model

Model Diagnostics

Tree Girth and Height Example

Interpreting the Model

Variance Explained

Using broom

Correlation Is Not Causation

Confounding

Linear Regression Quiz

Multiple Linear Regression

Multiple Linear Regression

Multiple Linear Regression Quiz

Beyond Linear Regression

Beyond Linear Regression

Mean Different From Expectation?

Testing Mean Difference From Expectation in R

Hypothesis Testing

More Statistical Tests

Hypothesis Testing

The infer Package

Hypothesis Testing Quiz

Prediction Modeling

Prediction Modeling

What is Machine Learning?

Machine Learning Steps

Data Splitting

Train, Test, Validate

Train

Test

Validate

Variable Selection

Model Selection

Regression vs. Classification

Model Accuracy

Prediction and Machine Learning Quiz

The tidymodels Ecosystem

The tidymodels Ecosystem

Benefits of tidymodels

Packages of tidymodels

Example of Continuous Variable Prediction

Example of Categorical Variable Prediction

tidymodels Quiz

Case Studies

Case Study #1: Predicting Annual Air Pollution

The Data

Data Import

Data Exploration and Wrangling

Evaluate Correlation

Splitting the Data

Making a Recipe

Running Preprocessing

Specifying the Model

Assessing the Model Fit

Model Performance: Getting Predicted Values

Visualizing Model Performance

Quantifying Model Performance

Assessing Model Performance on v -folds Using tune

Random Forest

Model Tuning

Final model performance evaluation

Summary of tidymodels

Summary of tidymodels

Project: Modeling Data in the Tidyverse

Important information before you start the quiz

C?ourse Project Prediction Quiz

Modeling Data in the Tidyverse
 at 
Coursera 
Admission Process

    Important Dates

    May 25, 2024
    Course Commencement Date

    Other courses offered by Coursera

    – / –
    3 months
    Beginner
    – / –
    20 hours
    Beginner
    – / –
    2 months
    Beginner
    – / –
    3 months
    Beginner
    View Other 6715 CoursesRight Arrow Icon
    qna

    Modeling Data in the Tidyverse
     at 
    Coursera 

    Student Forum

    chatAnything you would want to ask experts?
    Write here...