Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Leer Challenge: Build a Preprocessing Pipeline | Choosing and Evaluating Techniques
Feature Scaling and Normalization in Python

bookChallenge: Build a Preprocessing Pipeline

Taak

Swipe to start coding

You're given a small mixed-type dataset. Build a leakage-safe preprocessing + model pipeline with scikit-learn:

  1. Split data into X (features) and y (target), then do a train/test split (test_size=0.3, random_state=42).
  2. Create a ColumnTransformer named preprocess:
    • numeric columns → StandardScaler()
    • categorical columns → OneHotEncoder(handle_unknown="ignore")
  3. Build a Pipeline named pipe with steps:
    • ("preprocess", preprocess)
    • ("clf", LogisticRegression(max_iter=1000, random_state=0))
  4. Fit on train only, then predict on test:
    • compute y_pred and test_accuracy = accuracy_score(y_test, y_pred)
  5. Add a few prints at the end to show shapes and the accuracy.

Oplossing

Was alles duidelijk?

Hoe kunnen we het verbeteren?

Bedankt voor je feedback!

Sectie 5. Hoofdstuk 3
single

single

Vraag AI

expand

Vraag AI

ChatGPT

Vraag wat u wilt of probeer een van de voorgestelde vragen om onze chat te starten.

close

Awesome!

Completion rate improved to 5.26

bookChallenge: Build a Preprocessing Pipeline

Veeg om het menu te tonen

Taak

Swipe to start coding

You're given a small mixed-type dataset. Build a leakage-safe preprocessing + model pipeline with scikit-learn:

  1. Split data into X (features) and y (target), then do a train/test split (test_size=0.3, random_state=42).
  2. Create a ColumnTransformer named preprocess:
    • numeric columns → StandardScaler()
    • categorical columns → OneHotEncoder(handle_unknown="ignore")
  3. Build a Pipeline named pipe with steps:
    • ("preprocess", preprocess)
    • ("clf", LogisticRegression(max_iter=1000, random_state=0))
  4. Fit on train only, then predict on test:
    • compute y_pred and test_accuracy = accuracy_score(y_test, y_pred)
  5. Add a few prints at the end to show shapes and the accuracy.

Oplossing

Switch to desktopSchakel over naar desktop voor praktijkervaringGa verder vanaf waar je bent met een van de onderstaande opties
Was alles duidelijk?

Hoe kunnen we het verbeteren?

Bedankt voor je feedback!

Sectie 5. Hoofdstuk 3
single

single

some-alt