Contenido del Curso
Cluster Analysis
Cluster Analysis
Implementing on Dummy Dataset
You'll create two datasets to demonstrate DBSCAN's strengths:
-
Moons: two interleaving half circles;
-
Circles: a small circle inside a larger circle.
The algorithm is as follows:
-
You instantiate the
DBSCAN
object, settingeps
andmin_samples
; -
You fit the model to your data;
-
You visualize the results by plotting the data points and coloring them according to their assigned cluster labels.
Tuning Hyperparameters
The choice of eps
and min_samples
significantly impacts the clustering outcome. Experiment with different values to find what works best for your data. For instance, if eps
is too large, all points might end up in a single cluster. If eps
is too small, many points might be classified as noise. You can also scale the features.
¡Gracias por tus comentarios!