Scorri per mostrare il menu

Challenge: Implementing a Random Forest

In sklearn, the classification version of Random Forest is implemented using the RandomForestClassifier:

You will also calculate the cross-validation accuracy using the cross_val_score() function:

In the end, you'll print the importance of each feature. The feature_importances_ attribute returns an array of importance scores — these scores represent how much each feature contributed to reducing Gini impurity across all the decision nodes where that feature was used. In other words, the more a feature helps split the data in a useful way, the higher its importance.

However, the attribute only gives the scores without feature names. To display both, you can pair them using Python’s zip() function:

for feature, importance in zip(X.columns, model.feature_importances_):
    print(feature, importance)

This prints each feature name along with its importance score, making it easier to understand which features the model relied on most.

Compito

Swipe to start coding

You are given a Titanic dataset stored as a DataFrame in the df variable.

Initialize the Random Forest model, set random_state=42, train it, and store the fitted model in the random_forest variable.
Calculate the cross-validation scores for the trained model using 10 folds, and store the resulting scores in the cv_scores variable.

Soluzione

Cambia al desktop per esercitarti nel mondo realeContinua da dove ti trovi utilizzando una delle opzioni seguenti

Tutto è chiaro?

Grazie per i tuoi commenti!

Sezione 4. Capitolo 3

single

Chieda ad AI

Chieda pure quello che desidera o provi una delle domande suggerite per iniziare la nostra conversazione

Challenge: Implementing a Random Forest

In sklearn, the classification version of Random Forest is implemented using the RandomForestClassifier:

You will also calculate the cross-validation accuracy using the cross_val_score() function:

However, the attribute only gives the scores without feature names. To display both, you can pair them using Python’s zip() function:

for feature, importance in zip(X.columns, model.feature_importances_):
    print(feature, importance)

This prints each feature name along with its importance score, making it easier to understand which features the model relied on most.

Compito

Swipe to start coding

You are given a Titanic dataset stored as a DataFrame in the df variable.

Initialize the Random Forest model, set random_state=42, train it, and store the fitted model in the random_forest variable.
Calculate the cross-validation scores for the trained model using 10 folds, and store the resulting scores in the cv_scores variable.

Soluzione

Cambia al desktop per esercitarti nel mondo realeContinua da dove ti trovi utilizzando una delle opzioni seguenti

Tutto è chiaro?

Grazie per i tuoi commenti!

Scorri per mostrare il menu

Challenge: Implementing a Random Forest

Soluzione

Awesome!

Challenge: Implementing a Random Forest

Soluzione

Awesome!