Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Lære Implementing on Dummy Dataset | DBSCAN
Cluster Analysis
course content

Kursusindhold

Cluster Analysis

Cluster Analysis

1. Clustering Fundamentals
2. Core Concepts
3. K-Means
4. Hierarchical Clustering
5. DBSCAN
6. GMMs

book
Implementing on Dummy Dataset

You'll create two datasets to demonstrate DBSCAN's strengths:

  • Moons: two interleaving half circles;

  • Circles: a small circle inside a larger circle.

The algorithm is as follows:

  1. You instantiate the DBSCAN object, setting eps and min_samples;

  2. You fit the model to your data;

  3. You visualize the results by plotting the data points and coloring them according to their assigned cluster labels.

Tuning Hyperparameters

The choice of eps and min_samples significantly impacts the clustering outcome. Experiment with different values to find what works best for your data. For instance, if eps is too large, all points might end up in a single cluster. If eps is too small, many points might be classified as noise. You can also scale the features.

Var alt klart?

Hvordan kan vi forbedre det?

Tak for dine kommentarer!

Sektion 5. Kapitel 4

Spørg AI

expand
ChatGPT

Spørg om hvad som helst eller prøv et af de foreslåede spørgsmål for at starte vores chat

course content

Kursusindhold

Cluster Analysis

Cluster Analysis

1. Clustering Fundamentals
2. Core Concepts
3. K-Means
4. Hierarchical Clustering
5. DBSCAN
6. GMMs

book
Implementing on Dummy Dataset

You'll create two datasets to demonstrate DBSCAN's strengths:

  • Moons: two interleaving half circles;

  • Circles: a small circle inside a larger circle.

The algorithm is as follows:

  1. You instantiate the DBSCAN object, setting eps and min_samples;

  2. You fit the model to your data;

  3. You visualize the results by plotting the data points and coloring them according to their assigned cluster labels.

Tuning Hyperparameters

The choice of eps and min_samples significantly impacts the clustering outcome. Experiment with different values to find what works best for your data. For instance, if eps is too large, all points might end up in a single cluster. If eps is too small, many points might be classified as noise. You can also scale the features.

Var alt klart?

Hvordan kan vi forbedre det?

Tak for dine kommentarer!

Sektion 5. Kapitel 4
Vi beklager, at noget gik galt. Hvad skete der?
some-alt