Coursera
Coursera Logo

John Hopkins University - Exploratory Data Analysis 

  • Offered byCoursera

Exploratory Data Analysis
 at 
Coursera 
Overview

Duration

4 hours

Start from

Start Now

Total fee

Free

Mode of learning

Online

Difficulty level

Beginner

Official Website

Explore Free Course External Link Icon

Credential

Certificate

Exploratory Data Analysis
 at 
Coursera 
Highlights

  • Earn a Certificate of completion from Johns Hopkins University on successful course completion
  • Instructors - Roger D. Peng, Jeff Leek, and Brian Caffo
  • Shareable Certificates
  • Self-Paced Learning Option
Read more
Details Icon

Exploratory Data Analysis
 at 
Coursera 
Course details

Skills you will learn
Who should do this course?
  • The course is desigend for those who want to learn exploratory techniques for summarizing data.
More about this course
  • This course covers the essential exploratory techniques for summarizing data. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data. We will cover in detail the plotting systems in R as well as some of the basic principles of constructing data graphics. We will also cover some of the common multivariate statistical techniques used to visualize high-dimensional data.

Exploratory Data Analysis
 at 
Coursera 
Curriculum

Week 1 - This week covers the basics of analytic graphics and the base plotting system in R. We've also included some background material to help you install R if you haven't done so already.

Introduction

Installing R on Windows (3.2.1)

Installing R on a Mac (3.2.1)

Installing R Studio (Mac)

Setting Your Working Directory (Windows)

Setting Your Working Directory (Mac)

Principles of Analytic Graphics

Exploratory Graphs (part 1)

Exploratory Graphs (part 2)

Plotting Systems in R

Base Plotting System (part 1)

Base Plotting System (part 2)

Base Plotting Demonstration

Graphics Devices in R (part 1)

Graphics Devices in R (part 2)

1 practice exercise

Week 2 - Welcome to Week 2 of Exploratory Data Analysis. This week covers some of the more advanced graphing systems available in R: the Lattice system and the ggplot2 system. While the base graphics system provides many important tools for visualizing data, it was part of the original R system and lacks many features that may be desirable in a plotting system, particularly when visualizing high dimensional data. The Lattice and ggplot2 systems also simplify the laying out of plots making it a much less tedious process.

Lattice Plotting System (part 1)

Lattice Plotting System (part 2)

ggplot2 (part 1)

ggplot2 (part 2)

ggplot2 (part 3)

ggplot2 (part 4)

ggplot2 (part 5)

1 practice exercise

Week 3 - Welcome to Week 3 of Exploratory Data Analysis. This week covers some of the workhorse statistical methods for exploratory analysis. These methods include clustering and dimension reduction techniques that allow you to make graphical displays of very high dimensional data (many many variables). We also cover novel ways to specify colors in R so that you can use color as an important and useful dimension when making data graphics. All of this material is covered in chapters 9-12 of my book Exploratory Data Analysis with R.

Hierarchical Clustering (part 1)

Hierarchical Clustering (part 2)

Hierarchical Clustering (part 3)

K-Means Clustering (part 1)

K-Means Clustering (part 2)

Dimension Reduction (part 1)

Dimension Reduction (part 2)

Dimension Reduction (part 3)

Working with Color in R Plots (part 1)

Working with Color in R Plots (part 2)

Working with Color in R Plots (part 3)

Working with Color in R Plots (part 4)

Week 4 - This week, we'll look at two case studies in exploratory data analysis. The first involves the use of cluster analysis techniques, and the second is a more involved analysis of some air pollution data. How one goes about doing EDA is often personal, but I'm providing these videos to give you a sense of how you might proceed with a specific type of dataset.

Clustering Case Study

Air Pollution Case Study

Practical R Exercises in swirl Part 4

Post-Course Survey

Exploratory Data Analysis
 at 
Coursera 
Admission Process

    Important Dates

    May 25, 2024
    Course Commencement Date

    Other courses offered by Coursera

    – / –
    3 months
    Beginner
    – / –
    20 hours
    Beginner
    – / –
    2 months
    Beginner
    – / –
    3 months
    Beginner
    View Other 6715 CoursesRight Arrow Icon

    Exploratory Data Analysis
     at 
    Coursera 
    Students Ratings & Reviews

    5/5
    Verified Icon3 Ratings
    G
    Gautam Dewasi
    Exploratory Data Analysis
    Offered by Coursera
    5
    Learning Experience: eda basics ( how to visualize our data using python ) in ml
    Faculty: their were 1 faulty, he was good speaker updated and structure was :- basics of ml-> eda theory->practical->assisments->test
    Course Support: no
    Reviewed on 17 Sep 2022Read More
    Thumbs Up IconThumbs Down Icon
    S
    Sahil Naik
    Exploratory Data Analysis
    Offered by Coursera
    5
    Learning Experience: The pros are the flexible assignment deadlines and anywhere anytime course material availability. Cons are that some of them allow access to their course material for only a specific duration of time.
    Faculty: The quality of lecture was excellent. The practical knowledge gained from the sessions were really good The resource provided are great and are often more useful
    Course Support: I did not got any carrer support/assistance in this course
    Reviewed on 9 Sep 2022Read More
    Thumbs Up IconThumbs Down Icon
    View All 2 ReviewsRight Arrow Icon
    qna

    Exploratory Data Analysis
     at 
    Coursera 

    Student Forum

    chatAnything you would want to ask experts?
    Write here...