Kursinhalt
Clusteranalyse
Clusteranalyse
Distance Measures
Clustering groups similar data points. To do this, you need to measure the "distance" between points. Distance measures tell you how alike or different data points are. Choosing the right distance measure is important.
We'll look at two common distance measures: Euclidean distance and Manhattan distance.
Euclidean Distance
Euclidean distance is like measuring the straight-line distance between two points. Imagine you are looking at a map and measuring the distance between two cities as the crow flies. That's Euclidean distance. It's the most common way to measure distance.
Think of it simply as "as the crow flies" distance. It works well when you want to know the direct distance and all directions are equally important.
For example, if you have two points, think of using a ruler to measure straight between them.
Manhattan Distance
Manhattan distance is like measuring distance in a city where you have to walk along city blocks. You can't go diagonally through buildings; you have to walk along streets. It's also called city block distance. That's exactly Manhattan distance.
Think of it as walking city blocks. It's useful when movement is restricted to horizontal and vertical directions, or when you want to be less sensitive to big differences in just one direction.
Danke für Ihr Feedback!