Coursera
Coursera Logo

SAS Institute Of Management Studies - Predictive Modeling with Logistic Regression using SAS 

  • Offered byCoursera

Predictive Modeling with Logistic Regression using SAS
 at 
Coursera 
Overview

Duration

17 hours

Start from

Start Now

Total fee

Free

Mode of learning

Online

Difficulty level

Intermediate

Official Website

Explore Free Course External Link Icon

Credential

Certificate

Predictive Modeling with Logistic Regression using SAS
 at 
Coursera 
Highlights

  • Taught by top companies and universities.
  • Affordable programs and 7 day free trial.
  • Shareable Certificate upon completion.
Details Icon

Predictive Modeling with Logistic Regression using SAS
 at 
Coursera 
Course details

More about this course
  • This course covers predictive modeling using SAS/STAT software with emphasis on the LOGISTIC procedure. This course also discusses selecting variables and interactions, recoding categorical variables based on the smooth weight of evidence, assessing models, treating missing values, and using efficiency techniques for massive data sets. You learn to use logistic regression to model an individual's behavior as a function of known inputs, create effect plots and odds ratio plots, handle missing data values, and tackle multicollinearity in your predictors. You also learn to assess model performance and compare models.

Predictive Modeling with Logistic Regression using SAS
 at 
Coursera 
Curriculum

Course Overview and Logistics

Meet the Instructor

What You Learn in This Course

Learner Prerequisites

Using Forums and Getting Help

Access SAS Software for this Course

Set Up Data for This Course (REQUIRED)

About the Demos and Practices in this Course

Overview

Introduction

Goals of Predictive Modeling

Terms for Elements in Predictive Modeling

Basic Steps of Predictive Modeling

Applications of Predictive Modeling

Demonstration Scenario: Target Marketing for a Bank

Demo: Examining the Code for Generating Descriptive Statistics and Frequency Tables

Introduction

Data Challenges

Analytical Challenges

Separate Sampling

Avoiding the Optimism Bias: Honest Assessment

Splitting the Data for Model Training and Assessment

Demo: Splitting the Data

Summary

Practice: Exploring the Bank Data for the Target Marketing Project

Practice: Exploring the Veterans' Organization Data Used in the Practices

Question 1.01

Question 1.02

Question 1.03

Practice: Splitting the Data

Fitting the Model

Overview

Introduction

Understanding the Logistic Regression Model

Constraining the Posterior Probability Using the Logit Transformation

Understanding the Fitted Surface

Interpreting the Model by Calculating the Odds Ratio

Understanding Logistic Discrimination

Estimating Unknown Parameters Using Maximum Likelihood Estimation

Interpreting Concordant, Discordant, and Tied Pairs

Using PROC LOGISTIC to Fit Logistic Regression Models

Demo: Fitting a Basic Logistic Regression Model, Part 1

Demo: Fitting a Basic Logistic Regression Model, Part 2

Scoring New Cases

Demo: Scoring New Cases

Introduction

Understanding the Effect of Oversampling

Understanding the Offset

Demo: Correcting for Oversampling

Summary

Question 2.01

Question 2.02

Practice: Fitting a Logistic Regression Model

Fitting the Model Review

Preparing the Input Variables, Part 1

Overview

Introduction

Reasons for Missing Data

Complete Case Analysis

Methods for Imputing Missing Values

Missing Value Imputation with Missing Value Indicator Variables

Demo: Imputing Missing Values

Cluster Imputation

Introduction

Problems Caused by Categorical Inputs

Solutions to Problems Caused by Categorical Inputs

Linking to Other Data Sets

Collapsing Categories by Thresholding

Collapsing Categories by Using Greenacre's Method

Demo: Collapsing the Levels of a Nominal Input, Part 1

Demo: Collapsing the Levels of a Nominal Input, Part 2

Replacing Categorical Levels by Using Smoothed Weight-of-Evidence Coding

Demo: Computing the Smoothed Weight of Evidence

Introduction

Problem of Redundancy

Variable Clustering Method

Understanding Principal Components

Divisive Clustering

PROC VARCLUS Syntax

Selecting a Representative Variable from Each Cluster

Demo: Reducing Redundancy by Clustering Variables

Question 3.01

Practice: Imputing Missing Values

Question 3.02

Question 3.03

Question 3.04

Practice: Collapsing the Levels of a Nominal Input

Practice: Computing the Smoothed Weight of Evidence

Question 3.05

Practice: Reducing Redundancy by Clustering Variables

Preparing the Input Variables, Part 2

Introduction

Detecting Nonlinear Relationships

Demo: Performing Variable Screening, Part 1

Demo: Performing Variable Screening, Part 2

Univariate Binning and Smoothing

Demo: Creating Empirical Logit Plots

Remedies for Nonlinear Relationships

Demo: Accommodating a Nonlinear Relationship, Part 1

Demo: Accommodating a Nonlinear Relationship, Part 2

Introduction

Specifying a Subset Selection Method in PROC LOGISTIC

Best-Subsets Selection

Stepwise Selection

Backward Elimination

Scalability of the Subset Selection Methods in PROC LOGISTIC

Detecting Interactions

BIC-based Significance Level

Demo: Detecting Interactions

Demo: Using Backward Elimination to Subset the Variables

Demo: Displaying Odds Ratios for Variables Involved in Interactions

Demo: Creating an Interaction Plot

Demo: Using the Best-Subsets Selection Method

Demo: Using Fit Statistics to Select a Model

Summary of Preparing the Input Variables, Parts 1 and 2

Question 3.06

Practice: Performing Variable Screening

Practice: Creating Empirical Logit Plots

Question 3.07

Question 3.08

Question 3.09

Practice: Using Forward Selection to Detect Interactions

Question 3.10

Practice: Using Backward Elimination to Subset the Variables

Question 3.11

Practice: Using Fit Statistics to Select a Model

Preparing the Input Variables Review

Measuring Model Performance

Overview

Introduction

Fit versus Complexity

Assessing Models when Target Event Data Is Rare

Demo: Preparing the Validation Data

Introduction

Understanding the Confusion Matrix

Measuring Performance across Cutoffs by Using the ROC Curve

Choosing Depth by Using the Gains Chart

Effects of Oversampled Data on Performance Measures

Adjusting a Confusion Matrix for Oversampling

Demo: Measuring Model Performance Based on Commonly-Used Metrics

Introduction

Understanding the Effect of Cutoffs on Confusion Matrices

Understanding the Profit Matrix

Choosing the Optimal Cutoff by Using the Profit Matrix

Using the Central Cutoff

Using Profit to Assess Fit

Calculating Sampling Weights

Demo: Using a Profit Matrix to Measure Model Performance

Introduction

Plotting Class Separation

Assessing Overall Predictive Power

Demo: Using the K-S Statistic to Measure Model Performance

Introduction

Comparing ROC Curves of Several Models"

Demo: Comparing ROC Curves to Measure Model Performance

Using Macros to Compare Many Models

Demo: Comparing and Evaluating Many Models, Part 1

Demo: Comparing and Evaluating Many Models, Part 2

Summary

Question 4.01

Question 4.02

Question 4.03

Practice: Assessing Model Performance

Question 4.04

Question 4.05

Question 4.06

Question 4.07

Measuring Model Performance Review

SAS Certification Practice Exam - Statistical Business Analysis Using SAS®9: Regression and Modeling

About the Certification Exam

Predictive Modeling with Logistic Regression using SAS
 at 
Coursera 
Admission Process

    Important Dates

    May 25, 2024
    Course Commencement Date

    Other courses offered by Coursera

    – / –
    3 months
    Beginner
    – / –
    20 hours
    Beginner
    – / –
    2 months
    Beginner
    – / –
    3 months
    Beginner
    View Other 6715 CoursesRight Arrow Icon
    qna

    Predictive Modeling with Logistic Regression using SAS
     at 
    Coursera 

    Student Forum

    chatAnything you would want to ask experts?
    Write here...