Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Вивчайте How Hierarchical Clustering Works? | Hierarchical Clustering
Cluster Analysis
course content

Зміст курсу

Cluster Analysis

Cluster Analysis

1. Clustering Fundamentals
2. Core Concepts
3. K-Means
4. Hierarchical Clustering
5. DBSCAN
6. GMMs

book
How Hierarchical Clustering Works?

The algorithm can either start with each point in its own cluster and successively merge them (agglomerative clustering), or start with all points in one cluster and recursively split them into smaller clusters (divisive clustering).

Since agglomerative clustering is the more commonly used approach, we'll focus on it.

The most common type of hierarchical clustering is the bottom-up approach. The algorithm is as follows:

  1. Initialization: each data point is treated as a single cluster;

  2. Calculate proximity matrix: compute the distance between each pair of clusters;

  3. Merge clusters: the two closest clusters are merged into a single cluster;

  4. Update proximity matrix: recalculate the distances between the new cluster and all remaining clusters;

  5. Repeat: steps 3 and 4 are repeated until all data points are merged into a single cluster.

Linkage Types

The proximity between two clusters is defined by the linkage type. Common linkage methods used in hierarchical clustering are:

  • Single linkage: the distance between the closest two points in the two clusters;

  • Complete linkage: the distance between the farthest two points in the two clusters;

  • Average linkage: the average distance between all pairs of points in the two clusters;

  • Ward's method: minimizes the increase in the total within-cluster variance when merging two clusters.

The choice of linkage method can impact the shape and structure of the resulting clusters. Experimentation and domain knowledge are often helpful in selecting the best method for your data.

Dendrogram

The results of hierarchical clustering are often visualized using a dendrogram.

question mark

What is the primary characteristic of the bottom-up (agglomerative) hierarchical clustering approach?

Select the correct answer

Все було зрозуміло?

Як ми можемо покращити це?

Дякуємо за ваш відгук!

Секція 4. Розділ 1
Ми дуже хвилюємося, що щось пішло не так. Що трапилося?
some-alt