John Hopkins University - Foundations of Data Science: K-Means Clustering in Python

4.0 /5

(1 Rating)

Offered byCoursera

Foundations of Data Science: K-Means Clustering in Python
at
Coursera
Overview

Duration	29 hours
Total fee	Free
Mode of learning	Online
Difficulty level	Beginner
Official Website	Explore Free Course
Credential	Certificate

Foundations of Data Science: K-Means Clustering in Python
at
Coursera
Highlights

Earn a shareable certificate upon completion.
Flexible deadlines according to your schedule.
Earn a certificate from the University of London upon completion of course.

Foundations of Data Science: K-Means Clustering in Python
at
Coursera
Course details

Skills you will learn

Retail Retail marketing Python Data Science Marketing C Data structures Statistics Data analysis Big Data

More about this course

Organisations all around the world are using data to predict behaviours and extract valuable real-world insights to inform decisions. Managing and analysing big data has become an essential part of modern finance, retail, marketing, social science, development and research, medicine and government.
This MOOC, designed by an academic team from Goldsmiths, University of London, will quickly introduce you to the core concepts of Data Science to prepare you for intermediate and advanced Data Science courses. It focuses on the basic mathematics, statistics and programming skills that are necessary for typical data analysis tasks.
You will consider these fundamental concepts on an example data clustering task, and you will use this example to learn basic programming skills that are necessary for mastering Data Science techniques. During the course, you will be asked to do a series of mathematical and programming exercises and a small data clustering project for a given dataset.

Foundations of Data Science: K-Means Clustering in Python
at
Coursera
Curriculum

Week 1: Foundations of Data Science: K-Means Clustering in Python

Welcome and Introduction

Introduction to Data Science

What is Data?

Types of Data

Machine Learning

Supervised vs Unsupervised Learning

K-Means Clustering

Preparing your Data

A Real World Dataset

Types of Data ? Review Information

Supervised vs Unsupervised ? Review Information

K-Means Clustering ? Review Information

Week 1 Summative Assessment

Week 2: Means and Deviations in Mathematics and Python

2.0: Week 2 Introduction

2.1 ? Introduction to Mathematical Concepts of Data Clustering

2.2 ? Mean of One Dimensional Lists

2.3 ? Variance and Standard Deviation

2.4 Jupyter Notebooks

2.5 Variables

2.6 Lists

2.7 Computing the Mean

2.8 Better Lists: NumPy

2.9 Computing the Standard Deviation

Week 2 Conclusion

Population vs Sample, Bias

Variability, Standard Deviation and Bias

Python Style Guide

Numpy and Array Creation

Population vs Sample ? Review Information

Mean of One Dimensional Lists ? Review Information

Variance and Standard Deviation ? Review Information

Jupyter Notebooks ? Review Information

Variables ? Review Information

Lists ? Review Information

Computing the Mean ? Review Information

Better Lists ? Review Information

Computing the Standard Deviation ? Review Information

Week 2 Summative Assessment

Week 3: Moving from One to Two Dimensional Data

Week 3 Introduction

3.1 Multidimensional Data Points and Features

3.2 Multidimensional Mean

3.3 Dispersion: Multidimensional Variables

3.4 Distance Metrics

3.5 Normalisation

3.6 Outliers

3.7 Basic Plotting

3.7a Storing 2D Coordinates in a Single Data Structure

3.8 Multidimensional Mean

3.9 Adding Graphical Overlays

3.10 Calculating the Distance to the Mean

3.11 List Comprehension

3.12 Normalisation in Python

3.13 Outliers and Plotting Normalised Data

Week 3 Conclusion

Multidimensional Data Points and Features Recap

Multidimensional Mean Recap

Multidimensional Variables Recap

Distance Metrics Recap

Normalisation Recap

Note on Matplotlib

Matplotlib Scatter Plot Documentation

Matplotlib Patches Documentation

List Comprehension Documentation

3.12 Errata

Multidimensional Data Points and Features ? Review Information

Multidimensional Mean ? Review Information

Dispersion: Multidimensional Variables ? Review Information

Distance Metrics ? Review Information

Normalisation ? Review Information

Outliers ? Review Information

Basic Plotting ? Review Information

Storing 2D Coordinates ? Review Information

Multidimensional Mean ? Review Information

Adding Graphical Overlays ? Review Information

Calculating Distance ? Review Information

List Comprehension ? Review Information

Normalisation in Python ? Review Information

Outliers ? Review Information

Week 3 Summative Assessment

Week 4: Introducing Pandas and Using K-Means to Analyse Data

Week 4 Introduction

4.1: Using the Pandas Library to Read csv Files

4.1a: Sorting and Filtering Data Using Pandas

4.1b: Labelling Points on a Graph

4.1c: Labelling all the Points on a Graph

4.2: Eyeballing the Data

4.3: Using K-Means to Interpret the Data

Week 4: Conclusion

Week 4 Code Resources

Pandas Read_CSV Function

Other courses offered by Coursera

Databases and SQL for Data Science with Python

IBM - Institute of Business ManagementCertificate

Total Fees

– / –

Duration

3 months

Difficulty level

Beginner

Databases and SQL for Data Science with Python

IBM - Institute of Business ManagementCertificate

Total Fees

– / –

Duration

20 hours

Difficulty level

Beginner

Skills

Python RDBMS

Learn SQL Basics for Data Science Specialization

University of California, DavisCertificate

Total Fees

– / –

Duration

2 months

Difficulty level

Beginner

Skills

Data analysis MySQL Apache

Machine Learning for Marketing Specialization

CourseraCertificate

Total Fees

– / –

Duration

3 months

Difficulty level

Beginner

Skills

Data analysis

View Other 6719 Courses

Foundations of Data Science: K-Means Clustering in Python
at
Coursera
Students Ratings & Reviews

4/5

1 Rating

3-4
1

Harsha Veena

Foundations of Data Science: K-Means Clustering in Python

Offered by Coursera

Learning Experience: Explanation of mathematical version of kmeans clustering

Faculty: Instructors taught well Curriculum was relevant and comprehensive

Course Support: No career support provided

Reviewed on 21 May 2022Read More

View 1 Review

Foundations of Data Science: K-Means Clustering in Python

Coursera

Student Forum

Anything you would want to ask experts?

Write here...

Data ScienceData Science BasicsPython for data scienceFoundations of Data Science: K-Means Clustering in Python

Useful Links

Know more about Coursera

All About Coursera

Courses 2025

Reviews on Placements, Faculty & Facilities

Know more about Programs

Data Science Course, Certification, Degree, Fees, Admission, Career, Syllabus

Data Exploration

Deep Learning and Neural Networks

John Hopkins University - Foundations of Data Science: K-Means Clustering in Python

Foundations of Data Science: K-Means Clustering in Python at Coursera Overview

Foundations of Data Science: K-Means Clustering in Python at Coursera Highlights

Foundations of Data Science: K-Means Clustering in Python at Coursera Course details

Foundations of Data Science: K-Means Clustering in Python at Coursera Curriculum

Other courses offered by Coursera

Databases and SQL for Data Science with Python

Databases and SQL for Data Science with Python

Learn SQL Basics for Data Science Specialization

Machine Learning for Marketing Specialization

Foundations of Data Science: K-Means Clustering in Python at Coursera Students Ratings & Reviews

Student Forum

Useful Links

Know more about Coursera

Know more about Programs

Foundations of Data Science: K-Means Clustering in Python
at
Coursera
Overview

Foundations of Data Science: K-Means Clustering in Python
at
Coursera
Highlights

Foundations of Data Science: K-Means Clustering in Python
at
Coursera
Course details

Foundations of Data Science: K-Means Clustering in Python
at
Coursera
Curriculum

Foundations of Data Science: K-Means Clustering in Python
at
Coursera
Students Ratings & Reviews