Scikit-learn Concepts
The scikit-learn (sklearn) library provides tools for preprocessing and modeling. Its main object types are estimator, transformer, predictor, and model.
Estimator
Any class with .fit() is an estimator β it learns from data.
estimator.fit(X, y) # supervised
estimator.fit(X) # unsupervised
Transformer
A transformer has .fit() and .transform(), plus .fit_transform() to do both at once.
Transformers are usually used to transform the X array. However, as we will see in the example of LabelEncoder, some transformers are made for the y array.
nan values shown in the training set in the picture indicate missing data in Python.
Predictor
A predictor is an estimator with .predict() for generating outputs.
predictor.fit(X, y)
predictor.predict(X_new)
Model
A model is a predictor with .score(), which evaluates performance.
model.fit(X, y)
model.score(X, y)
As mentioned in the previous chapter, accuracy is a metric representing the percentage of correct predictions.
The preprocessing stage involves working with transformers, and we work with predictors (more specifically with models) at the modeling stage.
Thanks for your feedback!
Ask AI
Ask AI
Ask anything or try one of the suggested questions to begin our chat
Can you explain the difference between a transformer and a predictor?
What are some examples of estimators in scikit-learn?
How is the .score() method used to evaluate a model?
Awesome!
Completion rate improved to 3.13
Scikit-learn Concepts
Swipe to show menu
The scikit-learn (sklearn) library provides tools for preprocessing and modeling. Its main object types are estimator, transformer, predictor, and model.
Estimator
Any class with .fit() is an estimator β it learns from data.
estimator.fit(X, y) # supervised
estimator.fit(X) # unsupervised
Transformer
A transformer has .fit() and .transform(), plus .fit_transform() to do both at once.
Transformers are usually used to transform the X array. However, as we will see in the example of LabelEncoder, some transformers are made for the y array.
nan values shown in the training set in the picture indicate missing data in Python.
Predictor
A predictor is an estimator with .predict() for generating outputs.
predictor.fit(X, y)
predictor.predict(X_new)
Model
A model is a predictor with .score(), which evaluates performance.
model.fit(X, y)
model.score(X, y)
As mentioned in the previous chapter, accuracy is a metric representing the percentage of correct predictions.
The preprocessing stage involves working with transformers, and we work with predictors (more specifically with models) at the modeling stage.
Thanks for your feedback!