Kursinhalt
Clusteranalyse
Clusteranalyse
Optimal Number of Clusters
Unlike K-means, hierarchical clustering doesn't produce a fixed number of clusters directly. Instead, it produces a hierarchy. You need a method to decide where to cut the dendrogram to obtain the desired number of clusters.
Methods for Determining the Number of Clusters
To determine the optimal number of clusters, several methods are commonly used, including dendrogram visualization, the elbow method, and the silhouette method.
Dendrogram Visualization
This method involves visually inspecting the dendrogram for the largest vertical gaps that are not intersected by horizontal lines. The number of clusters can be deduced from the number of vertical lines these gaps encompass. However, this method is subjective and relies heavily on visual interpretation.
Elbow Method (using within-cluster sum of squares - WCSS)
In this approach, you perform hierarchical clustering for a range of cluster numbers and calculate the WCSS for each. By plotting the WCSS values against the number of clusters, you can identify an "elbow" point in the plot. This point indicates a good balance between minimizing WCSS and avoiding an excessive number of clusters, similar to the elbow method in K-means.
Silhouette Method
This method involves calculating silhouette scores for different numbers of clusters by cutting the dendrogram at various heights. The optimal number of clusters is the one that corresponds to the highest average silhouette score.
Danke für Ihr Feedback!