CSP 571- Data Preparation and Analysis
- Offered byCoursera
CSP 571- Data Preparation and Analysis at Coursera Overview
Duration | 79 hours |
Start from | Start Now |
Total fee | Free |
Mode of learning | Online |
Official Website | Explore Free Course |
Credential | Certificate |
CSP 571- Data Preparation and Analysis at Coursera Highlights
- Earn a certificate from Illinois Tech
- Add to your LinkedIn profile
CSP 571- Data Preparation and Analysis at Coursera Course details
- What you'll learn
- 1. Apply appropriate techniques for generating insights from data.
- 2. Present actionable solutions with confidence to the business stakeholders.
- This course introduces the necessary concepts and common techniques for analyzing data. The primary emphasis is on the process of data analysis, including data preparation, descriptive analytics, model training, and result interpretation. The process starts with removing distractions and anomalies, followed by discovering insights, formulating propositions, validating evidence, and finally building professional-grade solutions. Following the process properly, regularly, and transparently brings credibility and increases the impact of the results. This course will cover topics including Exploratory Data Analysis, Feature Screening, Segmentation, Association Rules, Nearest Neighbors, Clustering, Decision Tree, Linear Regression, Logistic Regression, and Performance Evaluation. Besides, this course will review statistical theory, matrix algebra, and computational techniques as necessary. This course prepares students ready for and capable of the data preparation and analysis process. Besides developing Python codes for carrying out the process, students will learn to tune the software tools for the most efficient implementation and optimal performance. At the end of this course, students will have built their inventory of data analysis codes and their confidence in advocating their propositions to the business stakeholders.
CSP 571- Data Preparation and Analysis at Coursera Curriculum
Module 1: Process of Preparing and Analyzing Data
Course Overview
Instructor Introduction
Module 1 Introduction
Why Do We Analyze Data
The Process of Data Analysis - Part 1
The Process of Data Analysis - Part 2
The First Step of Knowing Your Data - Part 1
The First Step of Knowing Your Data - Part 2
The First Step of Knowing Your Data - Part 3
The First Step of Knowing Your Data - Part 4
Syllabus
Data Files
Module 1 Introduction
Big Data and IEEE 754
CRISP-DM2
Selecting the Bin Size of a Time Histogram
Module 1 Summary
Why Do We Analyze Data Quiz
The Process of Data Analysis Quiz
Knowing Your Data Quiz
Module 1 Summative Assessment
Meet and Greet Discussion
Module 1 Python Lab
Module 2: Measure and Visualize Correlation
Module 2 Introduction
Discover and Measure Associations - Part 1
Discover and Measure Associations - Part 2
Measure Associations - Part 1
Measure Associations - Part 1 (Continued)
Measure Associations - Part 2
Measure Associations - Part 2 (Continued)
Module 2 Introduction
Chicago Taxi Trip Data
Correlation with Python
Eta-squared
Module 2 Summary
Correlation of Continuous Features Quiz
Correlation of Mixed Types Features
Means to an End for Feature Screening Quiz
Module 2 Summative Assessment
Module 2 Python Lab
Module 3: Market Basket Analysis
Module 3 Introduction
What is in Your Basket - Part 1
What is in Your Basket - Part 2
How Are Association Rules Discovered - Part 1
How Are Association Rules Discovered - Part 2
What Can Association Rules Tell Me - Part 1
What Can Association Rules Tell Me - Part 2
PGML Chapter 3
Cross-Selling
Apriori Algorithm and Association Rules
Module 3 Summary
Market Basket Analysis Quiz
Association Rules Discovery Quiz
Module 3 Summative Assessment
Module 3 Python Lab
Module 4: Partitioning, Segmenting, and Clustering of Observations
Module 4 Introduction
Partition Observations for Training Models - Part 1
Partition Observations for Training Models - Part 2
Create Segments of Observations for Business Reasons - Part 1
Create Segments of Observations for Business Reasons - Part 2
Put Observations with Similar Feature Values in Clusters - Part 1
Put Observations with Similar Feature Values in Clusters - Part 2
Put Observations with Similar Feature Values in Clusters - Part 3
PGML Chapter 4
Sampling Techniques
RFM
Clustering
Module 4 Summary
Partition Observations for Training Models Quiz
Segments of Observations Quiz
Clustering Quiz
Module 4 Summative Assessment
Module 4 Python Lab
Module 5: Linear Regression
Module 5 Introduction
Linear Regression Model - Part 1
Linear Regression Model - Part 2
Forward Selection - Part 1
Forward Selection - Part 2
Feature Importance - Part 1
Feature Importance - Part 2
Feature Importance - Part 3
Linear Regression Analysis
Least Squares Regression
Forward and Backward Stepwise Regression
Shapley Values
Module 5 Summary
Linear Regression Model Quiz
Feature Selection Quiz
Feature Importance Quiz
Module 5 Summative Assessment
Module 5 Python Lab
Module 6: Binary Logistic Regression
Module 6 Introduction
Logistic Regression - Part 1
Logistic Regression - Part 2
Forward Selection
Interpret Model and Assess Performance - Part 1
Interpret Model and Assess Performance - Part 2
PGML Chapter 6
Predictive Analytics
Forward Selection
Best R-squared for Logistic Regression
Module 6 Summary
Logistic Regression Quiz
Foward Selection Quiz
Blessing and Curse of Too Many Predictors Quiz
Module 6 Summative Assessment
Module 6 Python Lab
Module 7: Decision Trees - The CART Algorithm
Module 7 Introduction
Motivation of Decision Trees - Part 1
Motivation of Decision Trees - Part 2
The CART Algorithm - Part 1
The CART Algorithm - Part 2
Cluster Profiling - Part 1
Cluster Profiling - Part 2
PGML Chapter 5
CART
CART as an Equation
Decision Trees for Clustering
Module 7 Summary
Motivation of Decision Trees Quiz
The CART Algorithm Quiz
Cluster Profiling Quiz
Module 7 Summative Assessment
Module 7 Python Lab
Module 8: Evaluating the Performance of Models
Module 8 Introduction
Prediction Models
Nominal Classification Models
Binary Classification Models - Part 1
Binary Classification Models - Part 2
Binary Classification Models - Part 3
Binary Classification Models - Part 4
Binary Classification Models - Part 5
PGML Chapter 7, 8
Outliers
ROC Curve
Using Life Analysis
Module 8 Summary
Metrics for Prediction Models Quiz
Metrics for Classification Models Quiz
Charts for Classification Models Quiz
Module 8 Summative Assessment
Module 8 Python Lab
Summative Course Assessment