Mean Formula: Understanding the Basics of Central Tendency

Mean Formula: Understanding the Basics of Central Tendency

7 mins read324 Views Comment
Vikram
Vikram Singh
Assistant Manager - Content
Updated on May 22, 2024 13:36 IST

In this article, learn how to calculate the mean using the mean formula. The mean is a measure of central tendency that is widely used in statistics to describe the average value of a set of numbers. Discover the steps involved in computing the mean, as well as its strengths and limitations as a statistical measure.

2023_04_Feature-Image-Templates-11.jpg

Let us consider you are a teacher, and you have to find the average height or average score of all the students in a class, or you are a sales manager, and you have to find the average sales by all the employees for the last year. Then how will you calculate it?

Don’t worry; in statistics, there is a Mean statistics formula that will help you to get the answer to your above queries. It can also help you to make decisions or predictions.

This article will explore the concept of mean statistics, formulas to find the mean, its different types, calculation methods, some real-life applications, and its limitations.

Must Check: Free Statistics for Data Science Online Courses & Certifications

Must Check: Free Maths for Data Science Courses Online

So, without further delay, let’s get started.

Table of Content

Let’s start with the formal definition of Mean.

What is a Mean?

The mean in statistics refers to the measure of central tendency representing the average value of a numerical data set. It is a useful tool to summarize (or describe) a dataset and to detect patterns( or trends) in the data. 

Apart from summarizing and detecting patterns, it is used in finance, economics, and healthcare for decision-making.

It is defined as the ratio of the sum of all observations to the total number of observations.

Must Read: What is Maths for Data Science? A Beginner Guide to learn Maths for Data Science in 2023

Statistics Interview Questions for Data Scientists
Measures of Dispersion: Range, IQR, Variance, Standard Deviation
Introduction to Inferential Statistics

Mean Formula

Mean = Sum of all Observations / Total Number of Data Points (Observations)

Let’s take a simple example to calculate the mean of the data:

Example-1: Let the score of five students in a class is 10, 15, 20, 25, and 30. Find the mean score of the class.

Solution: 

Sum of all Observations = 10 + 15 + 20 + 25 + 30 = 100

Total Number of Observations = 5

=> Mean = Sum of all Observations / Number of Observations = 100 / 5 = 20

Hence, the Mean Score = 20

Whenever the mean is referred to in the context of statistics, it always means Arithmetic Mean. But two more types of mean exist, i.e., Geometric and Harmonic Mean.

Difference Between Covariance and Correlation
Standard Error vs. Standard Deviation
Difference between Median and Average

The next section will explore different types of mean in statistics.

Recommended online courses

Best-suited Statistics for Data Science courses for you

Learn Statistics for Data Science with these high-rated online courses

Free
12 weeks
– / –
12 weeks
– / –
10 days
Free
12 weeks
– / –
8 weeks
– / –
10 days
– / –
12 weeks
– / –
10 days

Types of Mean

There are 3 different types of Mean in Statistics: Arithmetic Mean, Geometric Mean, and Harmonic Mean.

We have already discussed arithmetic mean (the above formula and definition of mean), so now, we will explain only geometric and Harmonic Mean.

What is Geometric Mean?

A statistical measure which is used to find the central tendency of a set of values. Mainly it is useful for calculating the average rate of change. It can also provide a more accurate representation of central tendency when the data is skewed or has outliers.

Dissimilar to the arithmetic mean, it is calculated by multiplying all the data points (n-value) and then taking the n-th root of it.

Geometric Mean Formula

If x1, x2 , x3 , …, xn are n-data points, then

G.M = ( x1 * x2 * x3 * …. * xn )1/n

What is Harmonic Mean?

Similar to the arithmetic mean, it is a type of average that is defined as the reciprocal of the average of the reciprocal of the data values.

Confused!!

Let’s get the mathematical formula that will help you out.

Harmonic Mean Formula

If x1, x2 , x3 , …, xn are n-data points, then

HM = n / ( (1/x1) + (1/ x2 ) + (1/ x3 ) + …. + (1/ xn ))

Now, let’s take an example to find the harmonic mean value.

Example: Find the Harmonic Mean for the data: 2, 4, and 8.

Solution: First, take the reciprocal of all three data points and add them, i.e.

1/2 + 1/4 + 1/8 =  (4 + 2 + 1)/8 = ⅞

=> HM = 3 / (⅞) = 3 * (8/7) = 24/7 = 3.42

Hence, the Harmonic Mean for the data: 2, 4, and 8 is 3.42.

Difference between Median and Average
Nominal vs. Ordinal
Difference between Variance and Standard Deviation

Relation Between Arithmetic Mean, Geometric Mean, and Harmonic Mean

Harmonic Mean <= Geometric Mean <= Arithmetic Mean

How to Calculate the Mean

To make it simple and understandable, we divide the data values (or dataset) into Ungrouped and Grouped data.

Mean for Ungrouped Data

Ungrouped data is raw data that is not organized or sorted into categories or intervals.

Example:

  • Height of student in a class: 150cm, 175cm, 130cm, 145cm, 160cm.
  • Marks of students: 10, 14, 15, 13, 12, 8, 7, 10.

To calculate the mean of ungrouped data, you have to follow three simple steps:

  • Add all data values
  • Count the number of data values
  • Divide the sum of data values by the number of data values.

It is as simple as you have done in the very first example.

Isn’t it??

Now, it’s time to take an example to understand better how to calculate the mean of ungrouped data.

Example: Calculate the mean score of 10 students: 

If the marks of 10 students in a math exam are: 85, 78, 92, 89, 93, 87, 81, 90, 95, 88.

Then, calculate the mean score.

Solution: 

Sum of Data Values = 85 + 78 + 92 + 89 + 93 + 87 + 81 + 90 + 95 + 88 = 878

Number of Data Values = 10

Mean = Sum of Data Values / Number of Data Values = 878/10 = 87.8

Hence, the mean score of the 10 students in the mathematics exam is 87.8.

Difference between Correlation and Regression
Correlation vs Causation
Difference between Accuracy and Precision

Mean for Grouped Data

Grouped Data is a set of data values that have been grouped into categories or intervals.

Example: 

  • Sales in Each Quarter
  • The number of hours workers work per week can be grouped into intervals of 0-8 hours, 8-12 hours, and 12-16 hours.

To calculate the mean of the grouped data, you have to follow these simple steps:

  • If the data values (xi) are given in intervals, find the midpoint ((upper-class limit + lower-class limit)/2).
  • Multiply data values (or mid-points) by their corresponding frequency (fi), i.e., xi*fi
  • Add up all products from step-2, i.e., x1*f1 + x2*f2 + … + xn*fn.
  • The mean is calculated by dividing the above result by the number of observations (f1 + f2 + f3 + … + fn). 

Now, let’s take an example to understand better how to calculate the mean of grouped data.

Example: Consider the below data on the number of hours that students study per week.
Class Interval 0 – 4 4 – 8 8 – 12 12 – 16
Frequency 10 20 15 5

Find the mean.

Solution:

Class Interval Hour (xi) Frequency (fi) xi * fi
0 – 4 (4 + 0) / 2 = 2 10 2 * 10 = 20
4 – 8 (8 + 4) / 2 = 6  20 6 * 20 = 120
8 – 12 (8 + 12) / 2 = 10 15 10 * 15 = 150
12 – 16 (12 + 16) / 2 = 14 5 14 * 5 = 70

=> ∑xi* fix1*f1 + x2*f2 + x3*f3+ x4*f4 = 20 + 120 + 150 + 70 = 360 

and, ∑fi =  f1 + f2 + f3 + f4 =  10 + 20 + 15 + 5 = 50

=>  Mean = ∑xi* fi / ∑fi = 360 / 50 = 7.2

Hence, the mean number of hours students’ study per week is 7.2.

Real-life Application of Mean

Mean is used in different domains such as Finance, Medicine, Quality Control, and Sports to calculate average return on investment, to describe the average value of a group of measurements, to determine the average performance of a manufacturing process, and to calculate the average performance of the team or an individual player respectively.

Limitations of Mean

  • Influenced by Sample Size: Mean may not be a good estimate of the population mean if the sample size is small.
  • Sensitive to Outliers: Mean is extremely sensitive to outliers. Even a single outlier can significantly affect the value of the mean.
  • Inappropriate by Ordinal Data: Mean is inappropriate for the data where the values have a natural order but not numerical values. 
  • Limited Usefulness for Skewed Data: The mean is not a robust measure of central tendency for skewed distributions. If the data is highly skewed, the mean may not accurately reflect the typical value of the data.

Conclusion

In conclusion, the mean (arithmetic mean) formula is one of the fundamental tools in statistics, which is used to calculate the average value of the data points. While the mean is a useful measure of central tendency, it has limitations too. As it is highly vulnerable to outliers and skewed distributions. Therefore, it is essential to understand the importance of the mean formula.
Hope you will like the article.
Happy Learning!!

 

Understand the Basics of P-value
z-test : Definition and Example
t-test : Definition and Example
Chi-Square Test: Definition and Example
Central Limit Theorem
Probability Density Function: Definition, Properties, and Application
Top 10 Probability Questions Asked in Interviews
PDF vs. CDF: Difference Between PDF and CDF
Decoding Probability Formulas: Understand Chance and Uncertainty
All About Probability Mass Function
Total Probability Theorem: Definition, Example, and Applications
Probability Distributions used in Data Science
Normal Distribution: Definition and Examples
Poisson Distribution: Definition and Examples
Binomial Distribution: Definition and Examples
Difference between Permutation and Combination
Difference between Eigenvalue and Eigenvector
All about Symmetric Matrix
About the Author
author-image
Vikram Singh
Assistant Manager - Content

Vikram has a Postgraduate degree in Applied Mathematics, with a keen interest in Data Science and Machine Learning. He has experience of 2+ years in content creation in Mathematics, Statistics, Data Science, and Mac... Read Full Bio