Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Learn Challenge: Preprocessing the Dataset | Core Concepts
Cluster Analysis

Swipe to show menu

book
Challenge: Preprocessing the Dataset

Task

Swipe to start coding

You are given a synthetic dataset stored in the data variable.

  • Replace missing values in the 'Age' column with the mean value of this column and store the result in this column.
  • Create an instance of an appropriate encoder, which will be used for the 'City' column and store it in the city_encoder variable. Make sure to specify the removal of the first column.
  • Encode the values in the 'City' column using city_encoder and store the result in the city_encoded variable.
  • Create an instance of an appropriate encoder, which will be used for the 'Income' column and store it in the income_encoder variable. Note that 'High' > 'Middle' > 'Low'.
  • Encode the values in the 'Income' column using income_encoder and store the result in the 'Income' column.

Solution

Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
Everything was clear?

How can we improve it?

Thanks for your feedback!

SectionΒ 2. ChapterΒ 6
single

single

Ask AI

expand

Ask AI

ChatGPT

Ask anything or try one of the suggested questions to begin our chat

close

Awesome!

Completion rate improved to 2.94

book
Challenge: Preprocessing the Dataset

Task

Swipe to start coding

You are given a synthetic dataset stored in the data variable.

  • Replace missing values in the 'Age' column with the mean value of this column and store the result in this column.
  • Create an instance of an appropriate encoder, which will be used for the 'City' column and store it in the city_encoder variable. Make sure to specify the removal of the first column.
  • Encode the values in the 'City' column using city_encoder and store the result in the city_encoded variable.
  • Create an instance of an appropriate encoder, which will be used for the 'Income' column and store it in the income_encoder variable. Note that 'High' > 'Middle' > 'Low'.
  • Encode the values in the 'Income' column using income_encoder and store the result in the 'Income' column.

Solution

Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
Everything was clear?

How can we improve it?

Thanks for your feedback!

close

Awesome!

Completion rate improved to 2.94

Swipe to show menu

some-alt