Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Finding the Correlation | Extracting Data
Advanced Techniques in pandas
course content

Contenido del Curso

Advanced Techniques in pandas

Advanced Techniques in pandas

1. Getting Familiar With Indexing and Selecting Data
2. Dealing With Conditions
3. Extracting Data
4. Aggregating Data
5. Preprocessing Data

Finding the Correlation

Finally, let's move to the last method of this section called .corr(). It helps out a lot to find the relationship between numerical data. Imagine that you have a dataset on houses:

Let's examine the output of the data.corr() in our case:

So, let's do it step by step: You have vertical and horizontal values; each pair overlaps. In each overlap, we can receive a value from -1 to 1.

  • 1 means that two values depend on each other in a directly proportional way (if one value increases, the other increases too);
  • -1 means that two values depend on each other in an inversely proportional way (if one value increases, the other decreases);
  • 0 means that the two dependent values aren't proportional.

Note

If the dataset contains non-numeric columns, such as in the cars.csv dataset used in the task, you should set the argument numeric_only=True to compute the correlation using only the numeric columns.

Tarea

You'll end this section with an effortless task: apply the .corr() function to the dataset. Then, try to analyze the numbers you get.

Tarea

You'll end this section with an effortless task: apply the .corr() function to the dataset. Then, try to analyze the numbers you get.

Cambia al escritorio para practicar en el mundo realContinúe desde donde se encuentra utilizando una de las siguientes opciones

¿Todo estuvo claro?

Sección 3. Capítulo 7
toggle bottom row

Finding the Correlation

Finally, let's move to the last method of this section called .corr(). It helps out a lot to find the relationship between numerical data. Imagine that you have a dataset on houses:

Let's examine the output of the data.corr() in our case:

So, let's do it step by step: You have vertical and horizontal values; each pair overlaps. In each overlap, we can receive a value from -1 to 1.

  • 1 means that two values depend on each other in a directly proportional way (if one value increases, the other increases too);
  • -1 means that two values depend on each other in an inversely proportional way (if one value increases, the other decreases);
  • 0 means that the two dependent values aren't proportional.

Note

If the dataset contains non-numeric columns, such as in the cars.csv dataset used in the task, you should set the argument numeric_only=True to compute the correlation using only the numeric columns.

Tarea

You'll end this section with an effortless task: apply the .corr() function to the dataset. Then, try to analyze the numbers you get.

Tarea

You'll end this section with an effortless task: apply the .corr() function to the dataset. Then, try to analyze the numbers you get.

Cambia al escritorio para practicar en el mundo realContinúe desde donde se encuentra utilizando una de las siguientes opciones

¿Todo estuvo claro?

Sección 3. Capítulo 7
toggle bottom row

Finding the Correlation

Finally, let's move to the last method of this section called .corr(). It helps out a lot to find the relationship between numerical data. Imagine that you have a dataset on houses:

Let's examine the output of the data.corr() in our case:

So, let's do it step by step: You have vertical and horizontal values; each pair overlaps. In each overlap, we can receive a value from -1 to 1.

  • 1 means that two values depend on each other in a directly proportional way (if one value increases, the other increases too);
  • -1 means that two values depend on each other in an inversely proportional way (if one value increases, the other decreases);
  • 0 means that the two dependent values aren't proportional.

Note

If the dataset contains non-numeric columns, such as in the cars.csv dataset used in the task, you should set the argument numeric_only=True to compute the correlation using only the numeric columns.

Tarea

You'll end this section with an effortless task: apply the .corr() function to the dataset. Then, try to analyze the numbers you get.

Tarea

You'll end this section with an effortless task: apply the .corr() function to the dataset. Then, try to analyze the numbers you get.

Cambia al escritorio para practicar en el mundo realContinúe desde donde se encuentra utilizando una de las siguientes opciones

¿Todo estuvo claro?

Finally, let's move to the last method of this section called .corr(). It helps out a lot to find the relationship between numerical data. Imagine that you have a dataset on houses:

Let's examine the output of the data.corr() in our case:

So, let's do it step by step: You have vertical and horizontal values; each pair overlaps. In each overlap, we can receive a value from -1 to 1.

  • 1 means that two values depend on each other in a directly proportional way (if one value increases, the other increases too);
  • -1 means that two values depend on each other in an inversely proportional way (if one value increases, the other decreases);
  • 0 means that the two dependent values aren't proportional.

Note

If the dataset contains non-numeric columns, such as in the cars.csv dataset used in the task, you should set the argument numeric_only=True to compute the correlation using only the numeric columns.

Tarea

You'll end this section with an effortless task: apply the .corr() function to the dataset. Then, try to analyze the numbers you get.

Cambia al escritorio para practicar en el mundo realContinúe desde donde se encuentra utilizando una de las siguientes opciones
Sección 3. Capítulo 7
Cambia al escritorio para practicar en el mundo realContinúe desde donde se encuentra utilizando una de las siguientes opciones
We're sorry to hear that something went wrong. What happened?
some-alt