Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Histograms | Aggregating and Visualizing Data
Data Manipulation using pandas
course content

Contenido del Curso

Data Manipulation using pandas

Data Manipulation using pandas

1. Preprocessing Data: Part I
2. Preprocessing Data: Part II
3. Grouping Data
4. Aggregating and Visualizing Data
5. Joining Data

bookHistograms

Let's move on to the first visualization steps. By now you already know how to clean, prepare, and aggregate data for further analysis. We'll start with histograms.

What is a histogram? Histogram is a graph that represents frequencies of numerical data (usually numerical intervals). To build histogram in pandas, apply the .hist() method to selected data. For instance, let's build a histogram for the 'totinch' column.

Note that you don't need to use the print() function to output the plot.

12345678
# Importing the library import pandas as pd # Reading the file df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/f2947b09-5f0d-4ad9-992f-ec0b87cd4b3f/data4.csv') # Histogram for the totinch column values df.totinch.hist()
copy

As parameters, you can set color (color for rectangles, like 'r', 'g', 'b', etc.) or bins (number of intervals to divide data). Let's make rectangles red and set the number of intervals to 50.

12345678
# Importing the library import pandas as pd # Reading the file df = pd.read_csv('https://codefinity-content-media.s3.eu-west-1.amazonaws.com/f2947b09-5f0d-4ad9-992f-ec0b87cd4b3f/data4.csv') # Histogram for the totinch column values df.totinch.hist(color = 'r', bins = 50)
copy

¿Todo estuvo claro?

¿Cómo podemos mejorarlo?

¡Gracias por tus comentarios!

Sección 4. Capítulo 5
some-alt