Course Content
Introduction to Scikit Learn
Introduction to Scikit Learn
Loading Data Sets
Scikit-learn is one of the most widely used Python packages for Data Science and Machine Learning. Scikit-learn is a library implementing the algorithms for Supervised Learning and Unsupervised Learning. First of all, to work in Data Science you have to work with data. So, you have to get the data from somewhere.
One of the advantages of this library is that it already contains some basic datasets that can be used immediately. For this there are special functions in Scikit-learn. Let's look at them.
- Let's get data about people with diabets. For this dataset there is
load_diabetes()
function in the Scikit-learn library.
# Importing needed function from library from sklearn.datasets import load_boston # Using function fo get data about diabets from library X, y = load_diabetes(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
Here, we have very important bool parameters:
return_X_y
andas_frame
. Exaplain.
- Let's take a look at the another popular dataset about iris flowers and their features. See the example below.
# Importing needed function from library from sklearn.datasets import load_iris # Using function fo get data about flowers from library X, y = load_iris(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
Task
- You have to
load_wine
function from sklearn library. - You have to use imported function to get the data about properties and qualities of wines.
- Please, use very important parameters in the right way:
- you have to get the data, whica are divided by dependent and independetn variables;
- you have to get this data in the data frame format.
- Display independent variables(X) and dependent variable(y) on the screen.
Thanks for your feedback!
Loading Data Sets
Scikit-learn is one of the most widely used Python packages for Data Science and Machine Learning. Scikit-learn is a library implementing the algorithms for Supervised Learning and Unsupervised Learning. First of all, to work in Data Science you have to work with data. So, you have to get the data from somewhere.
One of the advantages of this library is that it already contains some basic datasets that can be used immediately. For this there are special functions in Scikit-learn. Let's look at them.
- Let's get data about people with diabets. For this dataset there is
load_diabetes()
function in the Scikit-learn library.
# Importing needed function from library from sklearn.datasets import load_boston # Using function fo get data about diabets from library X, y = load_diabetes(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
Here, we have very important bool parameters:
return_X_y
andas_frame
. Exaplain.
- Let's take a look at the another popular dataset about iris flowers and their features. See the example below.
# Importing needed function from library from sklearn.datasets import load_iris # Using function fo get data about flowers from library X, y = load_iris(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
Task
- You have to
load_wine
function from sklearn library. - You have to use imported function to get the data about properties and qualities of wines.
- Please, use very important parameters in the right way:
- you have to get the data, whica are divided by dependent and independetn variables;
- you have to get this data in the data frame format.
- Display independent variables(X) and dependent variable(y) on the screen.
Thanks for your feedback!
Loading Data Sets
Scikit-learn is one of the most widely used Python packages for Data Science and Machine Learning. Scikit-learn is a library implementing the algorithms for Supervised Learning and Unsupervised Learning. First of all, to work in Data Science you have to work with data. So, you have to get the data from somewhere.
One of the advantages of this library is that it already contains some basic datasets that can be used immediately. For this there are special functions in Scikit-learn. Let's look at them.
- Let's get data about people with diabets. For this dataset there is
load_diabetes()
function in the Scikit-learn library.
# Importing needed function from library from sklearn.datasets import load_boston # Using function fo get data about diabets from library X, y = load_diabetes(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
Here, we have very important bool parameters:
return_X_y
andas_frame
. Exaplain.
- Let's take a look at the another popular dataset about iris flowers and their features. See the example below.
# Importing needed function from library from sklearn.datasets import load_iris # Using function fo get data about flowers from library X, y = load_iris(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
Task
- You have to
load_wine
function from sklearn library. - You have to use imported function to get the data about properties and qualities of wines.
- Please, use very important parameters in the right way:
- you have to get the data, whica are divided by dependent and independetn variables;
- you have to get this data in the data frame format.
- Display independent variables(X) and dependent variable(y) on the screen.
Thanks for your feedback!
Scikit-learn is one of the most widely used Python packages for Data Science and Machine Learning. Scikit-learn is a library implementing the algorithms for Supervised Learning and Unsupervised Learning. First of all, to work in Data Science you have to work with data. So, you have to get the data from somewhere.
One of the advantages of this library is that it already contains some basic datasets that can be used immediately. For this there are special functions in Scikit-learn. Let's look at them.
- Let's get data about people with diabets. For this dataset there is
load_diabetes()
function in the Scikit-learn library.
# Importing needed function from library from sklearn.datasets import load_boston # Using function fo get data about diabets from library X, y = load_diabetes(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
Here, we have very important bool parameters:
return_X_y
andas_frame
. Exaplain.
- Let's take a look at the another popular dataset about iris flowers and their features. See the example below.
# Importing needed function from library from sklearn.datasets import load_iris # Using function fo get data about flowers from library X, y = load_iris(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
Task
- You have to
load_wine
function from sklearn library. - You have to use imported function to get the data about properties and qualities of wines.
- Please, use very important parameters in the right way:
- you have to get the data, whica are divided by dependent and independetn variables;
- you have to get this data in the data frame format.
- Display independent variables(X) and dependent variable(y) on the screen.