Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Learn Challenge: Preprocessing the Dataset | Core Concepts
Cluster Analysis
course content

Course Content

Cluster Analysis

Cluster Analysis

1. Clustering Fundamentals
2. Core Concepts
3. K-Means
4. Hierarchical Clustering
5. DBSCAN
6. GMMs

book
Challenge: Preprocessing the Dataset

Task

Swipe to start coding

You are given a synthetic dataset stored in the data variable.

  • Replace missing values in the 'Age' column with the mean value of this column and store the result in this column.
  • Create an instance of an appropriate encoder, which will be used for the 'City' column and store it in the city_encoder variable. Make sure to specify the removal of the first column.
  • Encode the values in the 'City' column using city_encoder and store the result in the city_encoded variable.
  • Create an instance of an appropriate encoder, which will be used for the 'Income' column and store it in the income_encoder variable. Note that 'High' > 'Middle' > 'Low'.
  • Encode the values in the 'Income' column using income_encoder and store the result in the 'Income' column.

Solution

Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
Everything was clear?

How can we improve it?

Thanks for your feedback!

SectionΒ 2. ChapterΒ 6
toggle bottom row

book
Challenge: Preprocessing the Dataset

Task

Swipe to start coding

You are given a synthetic dataset stored in the data variable.

  • Replace missing values in the 'Age' column with the mean value of this column and store the result in this column.
  • Create an instance of an appropriate encoder, which will be used for the 'City' column and store it in the city_encoder variable. Make sure to specify the removal of the first column.
  • Encode the values in the 'City' column using city_encoder and store the result in the city_encoded variable.
  • Create an instance of an appropriate encoder, which will be used for the 'Income' column and store it in the income_encoder variable. Note that 'High' > 'Middle' > 'Low'.
  • Encode the values in the 'Income' column using income_encoder and store the result in the 'Income' column.

Solution

Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
Everything was clear?

How can we improve it?

Thanks for your feedback!

SectionΒ 2. ChapterΒ 6
Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
We're sorry to hear that something went wrong. What happened?
some-alt