Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Aprenda Optimal Number of Clusters | Hierarchical Clustering
Cluster Analysis
course content

Conteúdo do Curso

Cluster Analysis

Cluster Analysis

1. Clustering Fundamentals
2. Core Concepts
3. K-Means
4. Hierarchical Clustering
5. DBSCAN
6. GMMs

book
Optimal Number of Clusters

Unlike K-means, hierarchical clustering doesn't produce a fixed number of clusters directly. Instead, it produces a hierarchy. You need a method to decide where to cut the dendrogram to obtain the desired number of clusters.

Methods for Determining the Number of Clusters

To determine the optimal number of clusters, several methods are commonly used, including dendrogram visualization, the elbow method, and the silhouette method.

Dendrogram Visualization

This method involves visually inspecting the dendrogram for the largest vertical gaps that are not intersected by horizontal lines. The number of clusters can be deduced from the number of vertical lines these gaps encompass. However, this method is subjective and relies heavily on visual interpretation.

Elbow Method (using within-cluster sum of squares - WCSS)

In this approach, you perform hierarchical clustering for a range of cluster numbers and calculate the WCSS for each. By plotting the WCSS values against the number of clusters, you can identify an "elbow" point in the plot. This point indicates a good balance between minimizing WCSS and avoiding an excessive number of clusters, similar to the elbow method in K-means.

Silhouette Method

This method involves calculating silhouette scores for different numbers of clusters by cutting the dendrogram at various heights. The optimal number of clusters is the one that corresponds to the highest average silhouette score.

question mark

Which of the following is a commonly used method to determine the number of clusters in hierarchical clustering?

Select the correct answer

Tudo estava claro?

Como podemos melhorá-lo?

Obrigado pelo seu feedback!

Seção 4. Capítulo 2
Sentimos muito que algo saiu errado. O que aconteceu?
some-alt