Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Correlation Matrix | Correlation
Explore the Linear Regression Using Python
course content

Conteúdo do Curso

Explore the Linear Regression Using Python

Explore the Linear Regression Using Python

1. What is the Linear Regression?
2. Correlation
3. Building and Training Model
4. Metrics to Evaluate the Model
5. Multivariate Linear Regression

bookCorrelation Matrix

Let’s go back to our dataset. To explore the relationships between all the columns, we can use a correlation matrix. It finds pairwise correlation coefficients of all columns(that's why the matrix is symmetric). Use the following method to build it and show correlation coefficients between all variables: dataframe.corr().

Use this code to see the matrix for our wine dataset:

12
matrix= data.corr().round(2) print(matrix)
copy

If we want to visualize this matrix just call function sns.heatmap and import library:

12
import seaborn as sns sns.heatmap(matrix, annot=True)
copy

If you want to improve your knowledge in Seaborn Visualization, click here!

We can see that alcohol is most positively correlated with the proline (0.64), which means that the amount of alcohol increases as the proline. The hue is most negatively correlated with the color intensity (-0.52), which means that the greater the color intensity of the wine, the lower the hue.

Tarefa

In the future, we will try to predict the characteristics of wine by the number of flavanoids in it. Flavanoids are plant pigments, and their most prominent role is to color our crops brightly.

  1. [Lines #3-4] Import the pandas, seaborn libraries.
  2. [Line #17] Write the code to define the correlation matrix rounding it to the second digit.
  3. [Lines #20-24] Find with which column flavanoids have the highest positive correlation and the negative correlation. Using the previous diagram we can obviously find that that's total_phenols (0.86) and nonflavanoid_phenols(-0.54) respectively. Assign numbers above to the variables positive_cor_value and negative_cor_value respectively (positive_cor_value = 0.86 and negative_cor_value = -0.54). Assign names and numbers to the corresponding variables.

Switch to desktopMude para o desktop para praticar no mundo realContinue de onde você está usando uma das opções abaixo
Tudo estava claro?

Como podemos melhorá-lo?

Obrigado pelo seu feedback!

Seção 2. Capítulo 2
toggle bottom row

bookCorrelation Matrix

Let’s go back to our dataset. To explore the relationships between all the columns, we can use a correlation matrix. It finds pairwise correlation coefficients of all columns(that's why the matrix is symmetric). Use the following method to build it and show correlation coefficients between all variables: dataframe.corr().

Use this code to see the matrix for our wine dataset:

12
matrix= data.corr().round(2) print(matrix)
copy

If we want to visualize this matrix just call function sns.heatmap and import library:

12
import seaborn as sns sns.heatmap(matrix, annot=True)
copy

If you want to improve your knowledge in Seaborn Visualization, click here!

We can see that alcohol is most positively correlated with the proline (0.64), which means that the amount of alcohol increases as the proline. The hue is most negatively correlated with the color intensity (-0.52), which means that the greater the color intensity of the wine, the lower the hue.

Tarefa

In the future, we will try to predict the characteristics of wine by the number of flavanoids in it. Flavanoids are plant pigments, and their most prominent role is to color our crops brightly.

  1. [Lines #3-4] Import the pandas, seaborn libraries.
  2. [Line #17] Write the code to define the correlation matrix rounding it to the second digit.
  3. [Lines #20-24] Find with which column flavanoids have the highest positive correlation and the negative correlation. Using the previous diagram we can obviously find that that's total_phenols (0.86) and nonflavanoid_phenols(-0.54) respectively. Assign numbers above to the variables positive_cor_value and negative_cor_value respectively (positive_cor_value = 0.86 and negative_cor_value = -0.54). Assign names and numbers to the corresponding variables.

Switch to desktopMude para o desktop para praticar no mundo realContinue de onde você está usando uma das opções abaixo
Tudo estava claro?

Como podemos melhorá-lo?

Obrigado pelo seu feedback!

Seção 2. Capítulo 2
toggle bottom row

bookCorrelation Matrix

Let’s go back to our dataset. To explore the relationships between all the columns, we can use a correlation matrix. It finds pairwise correlation coefficients of all columns(that's why the matrix is symmetric). Use the following method to build it and show correlation coefficients between all variables: dataframe.corr().

Use this code to see the matrix for our wine dataset:

12
matrix= data.corr().round(2) print(matrix)
copy

If we want to visualize this matrix just call function sns.heatmap and import library:

12
import seaborn as sns sns.heatmap(matrix, annot=True)
copy

If you want to improve your knowledge in Seaborn Visualization, click here!

We can see that alcohol is most positively correlated with the proline (0.64), which means that the amount of alcohol increases as the proline. The hue is most negatively correlated with the color intensity (-0.52), which means that the greater the color intensity of the wine, the lower the hue.

Tarefa

In the future, we will try to predict the characteristics of wine by the number of flavanoids in it. Flavanoids are plant pigments, and their most prominent role is to color our crops brightly.

  1. [Lines #3-4] Import the pandas, seaborn libraries.
  2. [Line #17] Write the code to define the correlation matrix rounding it to the second digit.
  3. [Lines #20-24] Find with which column flavanoids have the highest positive correlation and the negative correlation. Using the previous diagram we can obviously find that that's total_phenols (0.86) and nonflavanoid_phenols(-0.54) respectively. Assign numbers above to the variables positive_cor_value and negative_cor_value respectively (positive_cor_value = 0.86 and negative_cor_value = -0.54). Assign names and numbers to the corresponding variables.

Switch to desktopMude para o desktop para praticar no mundo realContinue de onde você está usando uma das opções abaixo
Tudo estava claro?

Como podemos melhorá-lo?

Obrigado pelo seu feedback!

Let’s go back to our dataset. To explore the relationships between all the columns, we can use a correlation matrix. It finds pairwise correlation coefficients of all columns(that's why the matrix is symmetric). Use the following method to build it and show correlation coefficients between all variables: dataframe.corr().

Use this code to see the matrix for our wine dataset:

12
matrix= data.corr().round(2) print(matrix)
copy

If we want to visualize this matrix just call function sns.heatmap and import library:

12
import seaborn as sns sns.heatmap(matrix, annot=True)
copy

If you want to improve your knowledge in Seaborn Visualization, click here!

We can see that alcohol is most positively correlated with the proline (0.64), which means that the amount of alcohol increases as the proline. The hue is most negatively correlated with the color intensity (-0.52), which means that the greater the color intensity of the wine, the lower the hue.

Tarefa

In the future, we will try to predict the characteristics of wine by the number of flavanoids in it. Flavanoids are plant pigments, and their most prominent role is to color our crops brightly.

  1. [Lines #3-4] Import the pandas, seaborn libraries.
  2. [Line #17] Write the code to define the correlation matrix rounding it to the second digit.
  3. [Lines #20-24] Find with which column flavanoids have the highest positive correlation and the negative correlation. Using the previous diagram we can obviously find that that's total_phenols (0.86) and nonflavanoid_phenols(-0.54) respectively. Assign numbers above to the variables positive_cor_value and negative_cor_value respectively (positive_cor_value = 0.86 and negative_cor_value = -0.54). Assign names and numbers to the corresponding variables.

Switch to desktopMude para o desktop para praticar no mundo realContinue de onde você está usando uma das opções abaixo
Seção 2. Capítulo 2
Switch to desktopMude para o desktop para praticar no mundo realContinue de onde você está usando uma das opções abaixo
some-alt