Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Loading Data Sets | The Very First Steps
Introduction to Scikit Learn
course content

Course Content

Introduction to Scikit Learn

Introduction to Scikit Learn

1. The Very First Steps
2. Scaling Numerical Data
3. Models in Scikit Learn

bookLoading Data Sets

Scikit-learn is one of the most widely used Python packages for Data Science and Machine Learning. Scikit-learn is a library implementing the algorithms for Supervised Learning and Unsupervised Learning. First of all, to work in Data Science you have to work with data. So, you have to get the data from somewhere.

One of the advantages of this library is that it already contains some basic datasets that can be used immediately. For this there are special functions in Scikit-learn. Let's look at them.

  1. Let's get data about people with diabets. For this dataset there is load_diabetes() function in the Scikit-learn library.
12345678
# Importing needed function from library from sklearn.datasets import load_boston # Using function fo get data about diabets from library X, y = load_diabetes(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
copy

Here, we have very important bool parameters: return_X_y and as_frame. Exaplain.

  1. Let's take a look at the another popular dataset about iris flowers and their features. See the example below.
12345678
# Importing needed function from library from sklearn.datasets import load_iris # Using function fo get data about flowers from library X, y = load_iris(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
copy

Task

  1. You have to load_wine function from sklearn library.
  2. You have to use imported function to get the data about properties and qualities of wines.
  3. Please, use very important parameters in the right way:
  • you have to get the data, whica are divided by dependent and independetn variables;
  • you have to get this data in the data frame format.
  1. Display independent variables(X) and dependent variable(y) on the screen.

Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
Everything was clear?

How can we improve it?

Thanks for your feedback!

Section 1. Chapter 1
toggle bottom row

bookLoading Data Sets

Scikit-learn is one of the most widely used Python packages for Data Science and Machine Learning. Scikit-learn is a library implementing the algorithms for Supervised Learning and Unsupervised Learning. First of all, to work in Data Science you have to work with data. So, you have to get the data from somewhere.

One of the advantages of this library is that it already contains some basic datasets that can be used immediately. For this there are special functions in Scikit-learn. Let's look at them.

  1. Let's get data about people with diabets. For this dataset there is load_diabetes() function in the Scikit-learn library.
12345678
# Importing needed function from library from sklearn.datasets import load_boston # Using function fo get data about diabets from library X, y = load_diabetes(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
copy

Here, we have very important bool parameters: return_X_y and as_frame. Exaplain.

  1. Let's take a look at the another popular dataset about iris flowers and their features. See the example below.
12345678
# Importing needed function from library from sklearn.datasets import load_iris # Using function fo get data about flowers from library X, y = load_iris(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
copy

Task

  1. You have to load_wine function from sklearn library.
  2. You have to use imported function to get the data about properties and qualities of wines.
  3. Please, use very important parameters in the right way:
  • you have to get the data, whica are divided by dependent and independetn variables;
  • you have to get this data in the data frame format.
  1. Display independent variables(X) and dependent variable(y) on the screen.

Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
Everything was clear?

How can we improve it?

Thanks for your feedback!

Section 1. Chapter 1
toggle bottom row

bookLoading Data Sets

Scikit-learn is one of the most widely used Python packages for Data Science and Machine Learning. Scikit-learn is a library implementing the algorithms for Supervised Learning and Unsupervised Learning. First of all, to work in Data Science you have to work with data. So, you have to get the data from somewhere.

One of the advantages of this library is that it already contains some basic datasets that can be used immediately. For this there are special functions in Scikit-learn. Let's look at them.

  1. Let's get data about people with diabets. For this dataset there is load_diabetes() function in the Scikit-learn library.
12345678
# Importing needed function from library from sklearn.datasets import load_boston # Using function fo get data about diabets from library X, y = load_diabetes(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
copy

Here, we have very important bool parameters: return_X_y and as_frame. Exaplain.

  1. Let's take a look at the another popular dataset about iris flowers and their features. See the example below.
12345678
# Importing needed function from library from sklearn.datasets import load_iris # Using function fo get data about flowers from library X, y = load_iris(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
copy

Task

  1. You have to load_wine function from sklearn library.
  2. You have to use imported function to get the data about properties and qualities of wines.
  3. Please, use very important parameters in the right way:
  • you have to get the data, whica are divided by dependent and independetn variables;
  • you have to get this data in the data frame format.
  1. Display independent variables(X) and dependent variable(y) on the screen.

Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
Everything was clear?

How can we improve it?

Thanks for your feedback!

Scikit-learn is one of the most widely used Python packages for Data Science and Machine Learning. Scikit-learn is a library implementing the algorithms for Supervised Learning and Unsupervised Learning. First of all, to work in Data Science you have to work with data. So, you have to get the data from somewhere.

One of the advantages of this library is that it already contains some basic datasets that can be used immediately. For this there are special functions in Scikit-learn. Let's look at them.

  1. Let's get data about people with diabets. For this dataset there is load_diabetes() function in the Scikit-learn library.
12345678
# Importing needed function from library from sklearn.datasets import load_boston # Using function fo get data about diabets from library X, y = load_diabetes(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
copy

Here, we have very important bool parameters: return_X_y and as_frame. Exaplain.

  1. Let's take a look at the another popular dataset about iris flowers and their features. See the example below.
12345678
# Importing needed function from library from sklearn.datasets import load_iris # Using function fo get data about flowers from library X, y = load_iris(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
copy

Task

  1. You have to load_wine function from sklearn library.
  2. You have to use imported function to get the data about properties and qualities of wines.
  3. Please, use very important parameters in the right way:
  • you have to get the data, whica are divided by dependent and independetn variables;
  • you have to get this data in the data frame format.
  1. Display independent variables(X) and dependent variable(y) on the screen.

Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
Section 1. Chapter 1
Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
some-alt