Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Lära Creating Histograms | Data Visualization
Data Analysis with R

bookCreating Histograms

Why Use Histograms?

Histograms are used to visualize the distribution of continuous (numerical) data. They show how data is spread across ranges (bins) and help us:

  • Detect skewness, outliers, or gaps;

  • Understand frequency distribution;

  • Quickly assess if the data is normally distributed or not.

They are best used for variables like price, mileage, or age.

Basic Histogram Syntax in ggplot2

ggplot(data = df, aes(x = variable)) +
  geom_histogram()

The x variable must be numeric.

Customize using bins, fill, color, theme, etc.

Example: Distribution of Selling Prices

ggplot(data = df, aes(x = selling_price)) +  
  geom_histogram(fill = "steelblue", color = "black") +  
  labs(title = "Distribution of Selling Prices", 
       x = "Selling Price (in PKR)", 
       y = "Count") +
  theme_minimal()

This plot shows how car prices are distributed. It can highlight if most cars fall in a certain price range.

question mark

What does the bins argument in geom_histogram() control?

Select the correct answer

Var allt tydligt?

Hur kan vi förbättra det?

Tack för dina kommentarer!

Avsnitt 2. Kapitel 3

Fråga AI

expand

Fråga AI

ChatGPT

Fråga vad du vill eller prova någon av de föreslagna frågorna för att starta vårt samtal

Awesome!

Completion rate improved to 4

bookCreating Histograms

Svep för att visa menyn

Why Use Histograms?

Histograms are used to visualize the distribution of continuous (numerical) data. They show how data is spread across ranges (bins) and help us:

  • Detect skewness, outliers, or gaps;

  • Understand frequency distribution;

  • Quickly assess if the data is normally distributed or not.

They are best used for variables like price, mileage, or age.

Basic Histogram Syntax in ggplot2

ggplot(data = df, aes(x = variable)) +
  geom_histogram()

The x variable must be numeric.

Customize using bins, fill, color, theme, etc.

Example: Distribution of Selling Prices

ggplot(data = df, aes(x = selling_price)) +  
  geom_histogram(fill = "steelblue", color = "black") +  
  labs(title = "Distribution of Selling Prices", 
       x = "Selling Price (in PKR)", 
       y = "Count") +
  theme_minimal()

This plot shows how car prices are distributed. It can highlight if most cars fall in a certain price range.

question mark

What does the bins argument in geom_histogram() control?

Select the correct answer

Var allt tydligt?

Hur kan vi förbättra det?

Tack för dina kommentarer!

Avsnitt 2. Kapitel 3
some-alt