Conteúdo do Curso
Classification with Python
Classification with Python
Challenge: Implementing a Decision Tree
In this challenge, you will use the Titanic dataset, which contains information about passengers on the Titanic, including their age, sex, family size, and more. The goal is to predict whether a passenger survived or not.
import pandas as pd df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/b71ff7ac-3932-41d2-a4d8-060e24b00129/titanic.csv') print(df.head())
To implement the Decision Tree, you can use the DecisionTreeClassifier
from sklearn
:
Your task is to build a Decision Tree and find the best max_depth
and min_samples_leaf
using grid search.
Swipe to start coding
You are given a Titanic dataset stored as a DataFrame
in the df
variable.
- Initialize a Decision Tree model and store it in the
decision_tree
variable. - Create a dictionary for
GridSearchCV
to iterate through[1, 2, 3, 4, 5, 6, 7]
values formax_depth
and[1, 2, 4, 6]
values formin_samples_leaf
, and store it in theparam_grid
variable. - Initialize and train a
GridSearchCV
object, set the number of folds to10
, and store the trained model in thegrid_cv
variable.
Solução
Obrigado pelo seu feedback!
Challenge: Implementing a Decision Tree
In this challenge, you will use the Titanic dataset, which contains information about passengers on the Titanic, including their age, sex, family size, and more. The goal is to predict whether a passenger survived or not.
import pandas as pd df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/b71ff7ac-3932-41d2-a4d8-060e24b00129/titanic.csv') print(df.head())
To implement the Decision Tree, you can use the DecisionTreeClassifier
from sklearn
:
Your task is to build a Decision Tree and find the best max_depth
and min_samples_leaf
using grid search.
Swipe to start coding
You are given a Titanic dataset stored as a DataFrame
in the df
variable.
- Initialize a Decision Tree model and store it in the
decision_tree
variable. - Create a dictionary for
GridSearchCV
to iterate through[1, 2, 3, 4, 5, 6, 7]
values formax_depth
and[1, 2, 4, 6]
values formin_samples_leaf
, and store it in theparam_grid
variable. - Initialize and train a
GridSearchCV
object, set the number of folds to10
, and store the trained model in thegrid_cv
variable.
Solução
Obrigado pelo seu feedback!