Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Lära Loading Data Sets | The Very First Steps
Introduction to Scikit Learn

Svep för att visa menyn

book
Loading Data Sets

Scikit-learn is one of the most widely used Python packages for Data Science and Machine Learning. Scikit-learn is a library implementing the algorithms for Supervised Learning and Unsupervised Learning. First of all, to work in Data Science you have to work with data. So, you have to get the data from somewhere.

One of the advantages of this library is that it already contains some basic datasets that can be used immediately. For this there are special functions in Scikit-learn. Let's look at them.

  1. Let's get data about people with diabets. For this dataset there is load_diabetes() function in the Scikit-learn library.

12345678
# Importing needed function from library from sklearn.datasets import load_boston # Using function fo get data about diabets from library X, y = load_diabetes(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
copy

Here, we have very important bool parameters: return_X_y and as_frame. Exaplain.

  1. Let's take a look at the another popular dataset about iris flowers and their features. See the example below.

12345678
# Importing needed function from library from sklearn.datasets import load_iris # Using function fo get data about flowers from library X, y = load_iris(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
copy
Uppgift

Swipe to start coding

  1. You have to load_wine function from sklearn library.
  2. You have to use imported function to get the data about properties and qualities of wines.
  3. Please, use very important parameters in the right way:
  • you have to get the data, whica are divided by dependent and independetn variables;
  • you have to get this data in the data frame format.
  1. Display independent variables(X) and dependent variable(y) on the screen.

Lösning

Switch to desktopByt till skrivbordet för praktisk övningFortsätt där du är med ett av alternativen nedan
Var allt tydligt?

Hur kan vi förbättra det?

Tack för dina kommentarer!

Avsnitt 1. Kapitel 1
Vi beklagar att något gick fel. Vad hände?

Fråga AI

expand
ChatGPT

Fråga vad du vill eller prova någon av de föreslagna frågorna för att starta vårt samtal

book
Loading Data Sets

Scikit-learn is one of the most widely used Python packages for Data Science and Machine Learning. Scikit-learn is a library implementing the algorithms for Supervised Learning and Unsupervised Learning. First of all, to work in Data Science you have to work with data. So, you have to get the data from somewhere.

One of the advantages of this library is that it already contains some basic datasets that can be used immediately. For this there are special functions in Scikit-learn. Let's look at them.

  1. Let's get data about people with diabets. For this dataset there is load_diabetes() function in the Scikit-learn library.

12345678
# Importing needed function from library from sklearn.datasets import load_boston # Using function fo get data about diabets from library X, y = load_diabetes(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
copy

Here, we have very important bool parameters: return_X_y and as_frame. Exaplain.

  1. Let's take a look at the another popular dataset about iris flowers and their features. See the example below.

12345678
# Importing needed function from library from sklearn.datasets import load_iris # Using function fo get data about flowers from library X, y = load_iris(return_X_y = True, as_frame = True) # Displaying independent variables print(X) # Dislpaying dependent variable print(y)
copy
Uppgift

Swipe to start coding

  1. You have to load_wine function from sklearn library.
  2. You have to use imported function to get the data about properties and qualities of wines.
  3. Please, use very important parameters in the right way:
  • you have to get the data, whica are divided by dependent and independetn variables;
  • you have to get this data in the data frame format.
  1. Display independent variables(X) and dependent variable(y) on the screen.

Lösning

Switch to desktopByt till skrivbordet för praktisk övningFortsätt där du är med ett av alternativen nedan
Var allt tydligt?

Hur kan vi förbättra det?

Tack för dina kommentarer!

Avsnitt 1. Kapitel 1
Switch to desktopByt till skrivbordet för praktisk övningFortsätt där du är med ett av alternativen nedan
Vi beklagar att något gick fel. Vad hände?
some-alt