Sektion 1. Kapitel 19
single
Challenge: Build a Preprocessing Pipeline
Stryg for at vise menuen
Opgave
Swipe to start coding
You're given a small mixed-type dataset. Build a leakage-safe preprocessing + model pipeline with scikit-learn:
- Split data into X (features) and y (target), then do a train/test split (
test_size=0.3,random_state=42). - Create a ColumnTransformer named
preprocess:- numeric columns →
StandardScaler() - categorical columns →
OneHotEncoder(handle_unknown="ignore")
- numeric columns →
- Build a Pipeline named
pipewith steps:("preprocess", preprocess)("clf", LogisticRegression(max_iter=1000, random_state=0))
- Fit on train only, then predict on test:
- compute
y_predandtest_accuracy = accuracy_score(y_test, y_pred)
- compute
- Add a few prints at the end to show shapes and the accuracy.
Løsning
Var alt klart?
Tak for dine kommentarer!
Sektion 1. Kapitel 19
single
Spørg AI
Spørg AI
Spørg om hvad som helst eller prøv et af de foreslåede spørgsmål for at starte vores chat