Course Content
Linear Regression for ML
Linear Regression for ML
Challenge
In this challenge, you are given the good old housing dataset, but this time only with the 'age'
feature.
import pandas as pd df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/b22d1166-efda-45e8-979e-6c3ecfc566fc/houses_poly.csv') print(df.head())
Let's build a scatterplot of this data.
import pandas as pd import matplotlib.pyplot as plt df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/b22d1166-efda-45e8-979e-6c3ecfc566fc/houses_poly.csv') X = df['age'] y = df['price'] plt.scatter(X, y, alpha=0.4)
Fitting a straight line to this data may not be a great choice.
The price gets higher for either brand-new or really old houses.
Fitting a parabola looks like a better choice. And that's what you will do in this challenge.
The task is to build a Polynomial Regression of degree 2 using a pipeline, as was shown in a previous chapter.
Here is a list of the classes and functions from sklearn
that you will need.
Task
- Create a model using the
make_pipeline
function.
As function arguments, pass the instances of classes that:- adds polynomial features of a degree
n
(don't forget to set theinclude_bias
toFalse
). - performs Linear Regression.
- adds polynomial features of a degree
- Train the model.
- Predict the target for
X_new
.
Thanks for your feedback!
Challenge
In this challenge, you are given the good old housing dataset, but this time only with the 'age'
feature.
import pandas as pd df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/b22d1166-efda-45e8-979e-6c3ecfc566fc/houses_poly.csv') print(df.head())
Let's build a scatterplot of this data.
import pandas as pd import matplotlib.pyplot as plt df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/b22d1166-efda-45e8-979e-6c3ecfc566fc/houses_poly.csv') X = df['age'] y = df['price'] plt.scatter(X, y, alpha=0.4)
Fitting a straight line to this data may not be a great choice.
The price gets higher for either brand-new or really old houses.
Fitting a parabola looks like a better choice. And that's what you will do in this challenge.
The task is to build a Polynomial Regression of degree 2 using a pipeline, as was shown in a previous chapter.
Here is a list of the classes and functions from sklearn
that you will need.
Task
- Create a model using the
make_pipeline
function.
As function arguments, pass the instances of classes that:- adds polynomial features of a degree
n
(don't forget to set theinclude_bias
toFalse
). - performs Linear Regression.
- adds polynomial features of a degree
- Train the model.
- Predict the target for
X_new
.
Thanks for your feedback!
Challenge
In this challenge, you are given the good old housing dataset, but this time only with the 'age'
feature.
import pandas as pd df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/b22d1166-efda-45e8-979e-6c3ecfc566fc/houses_poly.csv') print(df.head())
Let's build a scatterplot of this data.
import pandas as pd import matplotlib.pyplot as plt df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/b22d1166-efda-45e8-979e-6c3ecfc566fc/houses_poly.csv') X = df['age'] y = df['price'] plt.scatter(X, y, alpha=0.4)
Fitting a straight line to this data may not be a great choice.
The price gets higher for either brand-new or really old houses.
Fitting a parabola looks like a better choice. And that's what you will do in this challenge.
The task is to build a Polynomial Regression of degree 2 using a pipeline, as was shown in a previous chapter.
Here is a list of the classes and functions from sklearn
that you will need.
Task
- Create a model using the
make_pipeline
function.
As function arguments, pass the instances of classes that:- adds polynomial features of a degree
n
(don't forget to set theinclude_bias
toFalse
). - performs Linear Regression.
- adds polynomial features of a degree
- Train the model.
- Predict the target for
X_new
.
Thanks for your feedback!
In this challenge, you are given the good old housing dataset, but this time only with the 'age'
feature.
import pandas as pd df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/b22d1166-efda-45e8-979e-6c3ecfc566fc/houses_poly.csv') print(df.head())
Let's build a scatterplot of this data.
import pandas as pd import matplotlib.pyplot as plt df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/b22d1166-efda-45e8-979e-6c3ecfc566fc/houses_poly.csv') X = df['age'] y = df['price'] plt.scatter(X, y, alpha=0.4)
Fitting a straight line to this data may not be a great choice.
The price gets higher for either brand-new or really old houses.
Fitting a parabola looks like a better choice. And that's what you will do in this challenge.
The task is to build a Polynomial Regression of degree 2 using a pipeline, as was shown in a previous chapter.
Here is a list of the classes and functions from sklearn
that you will need.
Task
- Create a model using the
make_pipeline
function.
As function arguments, pass the instances of classes that:- adds polynomial features of a degree
n
(don't forget to set theinclude_bias
toFalse
). - performs Linear Regression.
- adds polynomial features of a degree
- Train the model.
- Predict the target for
X_new
.