Coursera
Coursera Logo

CSP 571- Data Preparation and Analysis 

  • Offered byCoursera

CSP 571- Data Preparation and Analysis
 at 
Coursera 
Overview

Duration

79 hours

Start from

Start Now

Total fee

Free

Mode of learning

Online

Official Website

Explore Free Course External Link Icon

Credential

Certificate

CSP 571- Data Preparation and Analysis
 at 
Coursera 
Highlights

  • Earn a certificate from Illinois Tech
  • Add to your LinkedIn profile
Details Icon

CSP 571- Data Preparation and Analysis
 at 
Coursera 
Course details

What are the course deliverables?
  • What you'll learn
  • 1. Apply appropriate techniques for generating insights from data.
  • 2. Present actionable solutions with confidence to the business stakeholders.
More about this course
  • This course introduces the necessary concepts and common techniques for analyzing data. The primary emphasis is on the process of data analysis, including data preparation, descriptive analytics, model training, and result interpretation. The process starts with removing distractions and anomalies, followed by discovering insights, formulating propositions, validating evidence, and finally building professional-grade solutions. Following the process properly, regularly, and transparently brings credibility and increases the impact of the results. This course will cover topics including Exploratory Data Analysis, Feature Screening, Segmentation, Association Rules, Nearest Neighbors, Clustering, Decision Tree, Linear Regression, Logistic Regression, and Performance Evaluation. Besides, this course will review statistical theory, matrix algebra, and computational techniques as necessary. This course prepares students ready for and capable of the data preparation and analysis process. Besides developing Python codes for carrying out the process, students will learn to tune the software tools for the most efficient implementation and optimal performance. At the end of this course, students will have built their inventory of data analysis codes and their confidence in advocating their propositions to the business stakeholders.
Read more

CSP 571- Data Preparation and Analysis
 at 
Coursera 
Curriculum

Module 1: Process of Preparing and Analyzing Data

Course Overview

Instructor Introduction

Module 1 Introduction

Why Do We Analyze Data

The Process of Data Analysis - Part 1

The Process of Data Analysis - Part 2

The First Step of Knowing Your Data - Part 1

The First Step of Knowing Your Data - Part 2

The First Step of Knowing Your Data - Part 3

The First Step of Knowing Your Data - Part 4

Syllabus

Data Files

Module 1 Introduction

Big Data and IEEE 754

CRISP-DM2

Selecting the Bin Size of a Time Histogram

Module 1 Summary

Why Do We Analyze Data Quiz

The Process of Data Analysis Quiz

Knowing Your Data Quiz

Module 1 Summative Assessment

Meet and Greet Discussion

Module 1 Python Lab

Module 2: Measure and Visualize Correlation

Module 2 Introduction

Discover and Measure Associations - Part 1

Discover and Measure Associations - Part 2

Measure Associations - Part 1

Measure Associations - Part 1 (Continued)

Measure Associations - Part 2

Measure Associations - Part 2 (Continued)

Module 2 Introduction

Chicago Taxi Trip Data

Correlation with Python

Eta-squared

Module 2 Summary

Correlation of Continuous Features Quiz

Correlation of Mixed Types Features

Means to an End for Feature Screening Quiz

Module 2 Summative Assessment

Module 2 Python Lab

Module 3: Market Basket Analysis

Module 3 Introduction

What is in Your Basket - Part 1

What is in Your Basket - Part 2

How Are Association Rules Discovered - Part 1

How Are Association Rules Discovered - Part 2

What Can Association Rules Tell Me - Part 1

What Can Association Rules Tell Me - Part 2

PGML Chapter 3

Cross-Selling

Apriori Algorithm and Association Rules

Module 3 Summary

Market Basket Analysis Quiz

Association Rules Discovery Quiz

Module 3 Summative Assessment

Module 3 Python Lab

Module 4: Partitioning, Segmenting, and Clustering of Observations

Module 4 Introduction

Partition Observations for Training Models - Part 1

Partition Observations for Training Models - Part 2

Create Segments of Observations for Business Reasons - Part 1

Create Segments of Observations for Business Reasons - Part 2

Put Observations with Similar Feature Values in Clusters - Part 1

Put Observations with Similar Feature Values in Clusters - Part 2

Put Observations with Similar Feature Values in Clusters - Part 3

PGML Chapter 4

Sampling Techniques

RFM

Clustering

Module 4 Summary

Partition Observations for Training Models Quiz

Segments of Observations Quiz

Clustering Quiz

Module 4 Summative Assessment

Module 4 Python Lab

Module 5: Linear Regression

Module 5 Introduction

Linear Regression Model​ - Part 1

Linear Regression Model​ - Part 2

Forward Selection - Part 1

Forward Selection - Part 2

Feature Importance -​ Part 1

Feature Importance -​ Part 2

Feature Importance -​ Part 3

Linear Regression Analysis

Least Squares Regression

Forward and Backward Stepwise Regression

Shapley Values

Module 5 Summary

Linear Regression Model Quiz

Feature Selection Quiz

Feature Importance Quiz

Module 5 Summative Assessment

Module 5 Python Lab

Module 6: Binary Logistic Regression

Module 6 Introduction

Logistic Regression -​ Part 1

Logistic Regression -​ Part 2

Forward Selection

Interpret Model and Assess Performance -​ Part 1

Interpret Model and Assess Performance -​ Part 2

PGML Chapter 6

Predictive Analytics

Forward Selection

Best R-squared for Logistic Regression

Module 6 Summary

Logistic Regression Quiz

Foward Selection Quiz

Blessing and Curse of Too Many Predictors Quiz

Module 6 Summative Assessment

Module 6 Python Lab

Module 7: Decision Trees - The CART Algorithm

Module 7 Introduction

Motivation of Decision Trees -​ Part 1

Motivation of Decision Trees -​ Part 2

The CART Algorithm -​ Part 1

The CART Algorithm -​ Part 2

Cluster Profiling -​ Part 1

Cluster Profiling​ - Part 2

PGML Chapter 5

CART

CART as an Equation

Decision Trees for Clustering

Module 7 Summary

Motivation of Decision Trees Quiz

The CART Algorithm Quiz

Cluster Profiling Quiz

Module 7 Summative Assessment

Module 7 Python Lab

Module 8: Evaluating the Performance of Models

Module 8 Introduction

Prediction Models

Nominal Classification Models

Binary Classification Models -​ Part 1

Binary Classification Models -​ Part 2

Binary Classification Models -​ Part 3

Binary Classification Models -​ Part 4

Binary Classification Models -​ Part 5

PGML Chapter 7, 8

Outliers

ROC Curve

Using Life Analysis

Module 8 Summary

Metrics for Prediction Models Quiz

Metrics for Classification Models Quiz

Charts for Classification Models Quiz

Module 8 Summative Assessment

Module 8 Python Lab

Summative Course Assessment

CSP 571- Data Preparation and Analysis
 at 
Coursera 
Admission Process

    Important Dates

    May 25, 2024
    Course Commencement Date

    Other courses offered by Coursera

    – / –
    3 months
    Beginner
    – / –
    20 hours
    Beginner
    – / –
    2 months
    Beginner
    – / –
    3 months
    Beginner
    View Other 6715 CoursesRight Arrow Icon
    qna

    CSP 571- Data Preparation and Analysis
     at 
    Coursera 

    Student Forum

    chatAnything you would want to ask experts?
    Write here...