Measures of Dispersion: Range, IQR, Variance, Standard Deviation

Measures of Dispersion: Range, IQR, Variance, Standard Deviation

4 mins read16.8K Views 1 Comment
Vikram
Vikram Singh
Assistant Manager - Content
Updated on Feb 3, 2023 16:28 IST

Introduction:

To describe the data, a measure of the central tendency is not just enough as it only gives information about the central values of the dataset. 

2022_01_Measures-of-Dispersion.jpg

It may be possible that the mean of the two datasets is equal but they are entirely different datasets.

In this article, we will discuss Measures of Dispersion.  

Table of Content

Recommended online courses

Best-suited Statistics for Data Science courses for you

Learn Statistics for Data Science with these high-rated online courses

Free
12 weeks
– / –
12 weeks
– / –
10 days
Free
12 weeks
– / –
8 weeks
– / –
10 days
– / –
12 weeks
– / –
10 days

Dispersion

Dispersion or variability describes how items are distributed (scattered) from each other and the center of a distribution.

Example: Height of students

In statistics, dispersion helps to understand the distribution of the data.

Measures of Dispersion

Statistical methods that help to know about the distribution or the spread of the data points in the datasets are known as Measures of Dispersion.

There are 4 methods to measures the dispersion of the data:

  • Range
  • Interquartile Range
  • Variance
  • Standard Deviation

2022_01_Measures-of-Dispersion.jpg

Range

The range is the easiest measure of dispersion. It is simply calculated by subtracting the highest value from the lowest value.

Range = Highest Value – Lowest Value

Let’s understand this by an example:

Problem Statement:

Let there be 5 students in the class having heights of 150cm, 160cm, 175cm, 190cm and 200 cm.

Calculate the range of heights?

Range = 200cm – 150cm 

Hence, Range = 50cm

2022_01_Range.jpg

Before defining the interquartile range, let’s discuss the quartiles and five-number summary

Quartiles: Quartiles divide the set into 4 equal parts.

There are three quartiles Q1, Q2 and Q3, where Q2 is the median of the distribution.

Five number summary:

Every dataset can be described using these 5 numbers

  • Lowest value
  • Q1: 25 percentile
  • Q2: Median
  • Q3: 75 Percentile
  • Highest Value

Interquartile Range 

Interquartile Range: Interquartile range is defined as the range between 75 percentile (Q3) and 25 percentile (Q1).

IQR = Q3 – Q1

Let’s understand Q1, Q2, Q3 and the Interquartile range by an example.

Problem Statement:

Let there are 8 numbers between 10 and 90 which are equally distributed.

Define the five-number summary and find the Interquartile Range?

  • Lowest value : 10
  • Q1 (25 percentile) : 25
  • Q2 (50 percentile) : 50
  • Q3 (75 percentile) : 75
  • Highest value : 90
  • Interquartile Range(IQR) = Q3 – Q1 = 75 – 25 = 50

Interquartile Range = 50

2022_01_Interquartile-range.jpg

Variance

  • defined as the average of squared difference from the mean.
  • measures how far each data point in datasets from the mean. 

To know more about the mean, read the article on Measures of Central Tendency.

Formula:

Population variation:

2022_01_variance-population-formula.jpg

Sample Variation:

2022_01_variance-sample-formula.jpg

To know more about Population and Sample, read the article.

Standard Deviation

Standard deviation is the square root of the variance.

Formula:

Population Standard Deviation

2022_01_standard-deviation-population-formula.jpg

Sample Standard Deviation
2022_01_standard-deviation-sample-formula.jpg

Now, let’s understand both by an example,

Problem Statement:

Let there be 5 students of height 1m, 2m, 3m, 4m and 5m.

Calculate the Standard Deviation.

2022_01_variance-and-Standard-Deviation.jpg

2022_01_variance-and-Standard-Deviation-problem.jpg

Note: A lower Standard Deviation indicates data points in datasets are dense near the mean.

In a normal distribution,  

  • 68% of value lies in 1st Standard Deviation
  • 95% of value lies in 2nd Standard Deviation
  • 99.7% of value lies in 3rd Standard Deviation

2022_01_MicrosoftTeams-image-1.jpg

To read more about Normal Distribution, read the article on Probability Distribution.

Which measure of dispersion is the best?

Standard Deviation is considered as the best measure of dispersion as,

  • Help to make comparison between the distribution of two or more different datasets
  • Based on all values
  • Capable of further algebraic treatment

Conclusion

In this article we briefly mention the Measures of Dispersion: with an example. Hope this article help in understanding the concepts easily.

————————————————————————————————————–
If you have recently completed a professional course/certification, click here to submit a review.

Frequently Ask Question (FAQ)

Ques 1. What is Dispersion?

Ans 1. Dispersion or variability describes how items are distributed (scattered) from each other and the center of a distribution.

Example: Height of students

Ques 2. What is Measure of Dispersion?

Ans 2. Statistical methods that help to know about the distribution or the spread of the data points in the datasets are known as Measures of Dispersion.

There are 4 methods to measures the dispersion of the data:

  • Range
  • Interquartile Range
  • Variance
  • Standard Deviation

Ques 3. Which measure of dispersion is best?

Ans 3. Standard Deviation is considered as the best measure of dispersion as,

  • Help to make comparison between the distribution of two or more different datasets
  • Based on all values
  • Capable of further algebraic treatment

Top Trending Articles in Statistics:

Skewness In Statistics | Statistics Interview Question | Basics Of Statistics | Measure Of Central Tendency | Probability Distribution | Inferential Statistics | Measure Of Dispersion | Introduction To Probability | Bayes Theorem | P-Value | Z-Test | T-Test | Chi-Square Test | Outliers In Python | Sampling and Resampling | Regression Analysis In Machine Learning | Gradient Descent | Normal Distribution | Poisson Distribution | Binomial Distribution | Covariance And Correlation | Conditional Probability | Central Limit Theorem

FAQs

What is Dispersion?

Dispersion or variability describes how items are distributed (scattered) from each other and the center of a distribution. Example: Height of students

What is Measure of Dispersion?

Statistical methods that help to know about the distribution or the spread of the data points in the datasets are known as Measures of Dispersion. There are 4 methods to measures the dispersion of the data: 1. Range 2. Interquartile Range 3. Variance 4. Standard Deviation

Which measure of dispersion is best?

Standard Deviation is considered as the best measure of dispersion as, 1. Help to make comparison between the distribution of two or more different datasets 2. Based on all values 3. Capable of further algebraic treatment

About the Author
author-image
Vikram Singh
Assistant Manager - Content

Vikram has a Postgraduate degree in Applied Mathematics, with a keen interest in Data Science and Machine Learning. He has experience of 2+ years in content creation in Mathematics, Statistics, Data Science, and Mac... Read Full Bio

Comments

(1)

Why is the denominator of variance 5-1? I think it should be 5.

Reply to Lucifer Morning star