All about Manhattan Distance

3 mins read3.1K Views Comment

Updated on Dec 5, 2022 11:36 IST

In Machine Learning Algorithms, we use distance metrics such as Euclidean, Manhattan, Minkowski, and Hamming.
In this article, we will briefly discuss one such metric, i.e., Manhattan Distance.

Different distance metrics are used in the machine-learning model; These metrics are the foundation of different machine-learning algorithms, whether it is a supervised (k-nearest neighbor) or unsupervised learning (k-mean clustering) algorithm. This article will discuss one such distance metric, i.e., Manhattan Distance Metric.

Must Check: Machine Learning Online Courses and Certification

Table of Content

Types of Distance Matrices in Machine Learning
Manhattan Distance
Example
Properties of Manhattan Distance
Conclusion

In k-mean clustering and k-Nearest Neighbor Algorithm, while creating clusters, you have to find the value of k and the data points that are closed enough to be considered as nearest neighbors; we use different distance metrics like Euclidean, Manhattan, Minkowski, or Hamming.

Recommended online courses

Best-suited Data Science courses for you

Learn Data Science with these high-rated online courses

Discontinued (July 2024)- Post Graduate Program in Business Analytics and Intelligence (PGP-BA&I)

Amity OnlineCertificate

Total Fees

– / –

Duration

12 months

Master of Computer Applications with specialization in Machine Learning and Artificial Intelligence (Online MCA)

Amity OnlineDegree

Total Fees

₹1.7 L

Duration

2 years

Certification in Data Science

MIT School of Distance EducationCertificate

Total Fees

₹80 K

Duration

4 months

Post Graduate Diploma in Big Data Science & Big Data Analysis

IIMT AhmedabadDiploma

Total Fees

₹1.18 L

Duration

12 months

Master of Science (Data Science)

Chandigarh University (CU)Degree

Total Fees

₹1.1 L

Duration

24 months

MCA with specialization in Machine Learning & Artificial Intelligence (ML & AI)

Amity OnlineDegree

Total Fees

₹2.5 L

Duration

2 years

MCA in Machine Learning

Amity University Online, NoidaDegree

Total Fees

₹2.5 L

Duration

2 years

Python Certificate

IIT MadrasCertificate

4.4

Total Fees

Free

Duration

4 weeks

PG Diploma in Artificial Intelligence (PG-DAI)

CDAC - Centre for Development of Advanced ComputingDiploma

4.0

Total Fees

₹1.27 L

Duration

6 weeks

Bachelor of Science in Programming and Data science

IIT MadrasDegree

3.5

Total Fees

₹1.24 L

Duration

48 months

Types of Distance Matrices in Machine Learning

Four distance metrics are mainly used in Machine Learning.

Euclidean

It is one of the most common distance metrics which is very often used in machine learning algorithms that calculates the distance between two real-valued vectors. It is the shortest distance between two points.

Mathematical Formula

The formula for Euclidean Distance (2-D):

d = [(x₁ – y₁ )²+ (x₂ – y₂)²]^1/2

Generalize Formula (n-D):

d = [(x₁ – y₁)² + (x₂ – y₂)² + (x₃ – y₃)² + …….. + (x_n – y_n)²]^½

Manhattan Distance

It is the sum of the absolute differences between points across all the dimensions.

It calculates the distance between real vectors.
It is also called Taxicab distance or City Block Distance.

Mathematical Formula

2-D

d = |x₁ – y₁| + |x₂ – y₂|

General Formula (n-D)

d = |x₁ – y₁| + |x₂ – y₂| + |x₃ – y₃| + |x₄ – y₄| + …… + |x_n – y_n|

Minkowski Distance

It is the generalization of Euclidean and Manhattan Distance.

Mathematical Formula

d = [|x₁ – y₁|^p + |x₂ – y₂|^p + |x₃ – y₃|^p + ….. + |x_n – y_n|^p]^1/p

Where p is the Order of Norm.

Hamming Distance

Hamming distance between two strings (of equal length) is the number of positions at which the corresponding alphabet or symbols differ.

In simple terms, the number of substitutes required to change one string to another.

Example:

Let there be two strings, “Naukri” and “Pujari”.

Since both the strings are of the same length, so we can calculate the Hamming Distance.

The first four places in both the strings differ, and the last two places have the same characters.

Naukri and Pujari

Hence, the hamming distance here will be 4.

Note: The larger Hamming distance value implies maximum dissimilarities between the two strings and vice versa.

Now, we will briefly discuss Manhattan Distance.

Manhattan Distance

Manhattan distance between two points X (x₁, x₂, x₃, ….., x_n) and Y (y₁, y₂, y₃, ….., y_n) in n-dimensional is the sum of the distance in each dimension.

It is called the Manhattan distance because it is the distance a car would drive in a city (e.g., Manhattan), where the buildings are laid out in square blocks, and the straight streets intersect at right angles.
Now, you also know why it is called a taxicab and city block distance.

Manhattan Distance using Python:

Calculating the Manhattan distance by defining a function

from math import sqrt

#define a manhattan function using sqrt function

def manhattan(a, b):
    return sum(abs(v1 - v2) for v1, v2 in zip (a, b))

#define the points
X = [1, 2, 3, 4, 5]
Y = [6, 7, 8, 9, 10]

#calculate the distance
manhattan (X, Y)
Copy code

Output

Note: You can also calculate the Manhattan distance using the scikit-learn library of Python.

from sklearn.metrics.pairwise import manhattan_distances
Copy code

Properties of Manhattan Distance

There are finite paths between two points whose length is equal to the Manhattan distance.
For a given point, the other point at a given Manhattan distance lies in the square.
A straight path with a length equal to Manhattan distance has only two permitted moves:
- Horizontal
- Vertical
Manhattan distance is a particular case of Minkowski Distance
- For p = 1, Manhattan Distance = Minkowski Distance
Manhattan Distance metric is preferred over Euclidean Distance when there is a high dimensionality in the data.

Conclusion

In this article, we have discussed the different types of distance metrics that are used in Machine Learning. We also covered Manhattan Distance in complete detail, with its properties and example in Python.
Hope you will like the article.
Keep Learning!!
Keep Sharing!!

About the Author

Vikram Singh

All about Manhattan Distance

Table of Content

Best-suited Data Science courses for you

Discontinued (July 2024)- Post Graduate Program in Business Analytics and Intelligence (PGP-BA&I)

Master of Computer Applications with specialization in Machine Learning and Artificial Intelligence (Online MCA)

Certification in Data Science

Post Graduate Diploma in Big Data Science & Big Data Analysis

Master of Science (Data Science)

MCA with specialization in Machine Learning & Artificial Intelligence (ML & AI)

MCA in Machine Learning

Python Certificate

PG Diploma in Artificial Intelligence (PG-DAI)

Bachelor of Science in Programming and Data science

Types of Distance Matrices in Machine Learning

Euclidean

Mathematical Formula

Manhattan Distance

Minkowski Distance

Hamming Distance

Manhattan Distance

Manhattan Distance using Python:

Properties of Manhattan Distance

Conclusion

Top Picks & New Arrivals