Challenge: Preprocessing Pipeline
Task
Swipe to start coding
You are given the Titanic dataset from the seaborn library.
Your task is to build a complete preprocessing pipeline that performs all essential data transformations used before machine learning.
Follow these steps:
- Load the dataset using
sns.load_dataset("titanic"). - Handle missing values:
- Numeric columns β fill with mean.
- Categorical columns β fill with mode.
- Encode the categorical features
sexandembarkedusingpd.get_dummies(). - Scale numeric columns
ageandfareusingStandardScaler. - Create a new feature
family_size = sibsp + parch + 1. - Combine all transformations into a function called
preprocess_titanic(data)that returns the final processed DataFrame. - Assign the processed dataset to a variable called
processed_data.
Print the first 5 rows of the final DataFrame.
Solution
Everything was clear?
Thanks for your feedback!
SectionΒ 3. ChapterΒ 4
single
Ask AI
Ask AI
Ask anything or try one of the suggested questions to begin our chat
Awesome!
Completion rate improved to 8.33
Challenge: Preprocessing Pipeline
Swipe to show menu
Task
Swipe to start coding
You are given the Titanic dataset from the seaborn library.
Your task is to build a complete preprocessing pipeline that performs all essential data transformations used before machine learning.
Follow these steps:
- Load the dataset using
sns.load_dataset("titanic"). - Handle missing values:
- Numeric columns β fill with mean.
- Categorical columns β fill with mode.
- Encode the categorical features
sexandembarkedusingpd.get_dummies(). - Scale numeric columns
ageandfareusingStandardScaler. - Create a new feature
family_size = sibsp + parch + 1. - Combine all transformations into a function called
preprocess_titanic(data)that returns the final processed DataFrame. - Assign the processed dataset to a variable called
processed_data.
Print the first 5 rows of the final DataFrame.
Solution
Everything was clear?
Thanks for your feedback!
SectionΒ 3. ChapterΒ 4
single