Seksjon 1. Kapittel 19
single
Challenge: Build a Preprocessing Pipeline
Sveip for å vise menyen
Oppgave
Sveip for å begynne å kode
You're given a small mixed-type dataset. Build a leakage-safe preprocessing + model pipeline with scikit-learn:
- Split data into X (features) and y (target), then do a train/test split (
test_size=0.3,random_state=42). - Create a ColumnTransformer named
preprocess:- numeric columns →
StandardScaler() - categorical columns →
OneHotEncoder(handle_unknown="ignore")
- numeric columns →
- Build a Pipeline named
pipewith steps:("preprocess", preprocess)("clf", LogisticRegression(max_iter=1000, random_state=0))
- Fit on train only, then predict on test:
- compute
y_predandtest_accuracy = accuracy_score(y_test, y_pred)
- compute
- Add a few prints at the end to show shapes and the accuracy.
Løsning
Alt var klart?
Takk for tilbakemeldingene dine!
Seksjon 1. Kapittel 19
single
Spør AI
Spør AI
Spør om hva du vil, eller prøv ett av de foreslåtte spørsmålene for å starte chatten vår