Summary  
This chapter covers how to generate pair plots from a DataFrame to visualize relationships and distributions among variables, with options to color by a categorical column (hue) and customize off-diagonal and diagonal plot types via `kind` and `diag_kind`.

General domain of usage  
Exploratory data analysis and visualization

A **pair plot** visualizes pairwise relationships between all numeric variables in a dataset. Unlike a joint plot, it is **not** limited to two variables. It creates an `N×N` grid of subplots, where `N` is the number of numeric columns in the `DataFrame`.

Definition

## Pair Plot Description

Each column in the grid shares the same **x-axis** variable, and each row shares the same **y-axis**. The diagonal displays **histograms** of individual variables, while off-diagonal cells show scatter plots.

## Creating a Pair Plot

You can create one using `seaborn.pairplot()`. Its only required argument is `data`, which must be a `DataFrame`. Parameters like `height` and `aspect` set the size (in inches) of each subplot.

import seaborn as sns
import matplotlib.pyplot as plt

# Loading the dataset with data about three different iris species
iris_df = sns.load_dataset('iris')

# Creating a pair plot
sns.pairplot(iris_df, height=2, aspect=0.8)

plt.show()

## Hue

The `hue` parameter assigns colors based on a specified categorical column. This highlights group differences and, when used in classification datasets, shows how classes separate across variable pairs.

With `hue` set (e.g., to `species`), the scatter plots color points by class, and diagonal plots switch from histograms to **KDE plots**, making class distributions clearer.

import seaborn as sns
import matplotlib.pyplot as plt

# Ignoring warnings
import warnings
warnings.filterwarnings('ignore')

# Loading the dataset with data about three different iris species
iris_df = sns.load_dataset('iris')

# Setting the hue parameter to 'species'
sns.pairplot(iris_df, hue='species', height=2, aspect=0.8)

plt.show()

## Changing Plot Kinds

You can customize both the main and diagonal plots.

* `kind` controls the off-diagonal plots (default: `'scatter'`);
* `diag_kind` controls the diagonal (histogram or KDE, often chosen automatically when `hue` is used).

import seaborn as sns
import matplotlib.pyplot as plt

# Loading the dataset with data about three different iris species
iris_df = sns.load_dataset('iris')

# Setting the kind parameter and diag_kind parameters
sns.pairplot(iris_df, hue='species', kind='reg', diag_kind=None, height=2, aspect=0.8)

plt.show()

`'scatter'`, `'kde'`, `'hist'`, `'reg'` are possible values for the `kind` parameter.

`diag_kind` can be set to one of the following values:
* `'auto'`;
* `'hist'`;
* `'kde'`;
* `None`. 

Everything is similar to the `jointplot()` function in this regard. 



Explore more in the <a href="https://seaborn.pydata.org/generated/seaborn.pairplot.html" target="_blank"><svg width="1em" height="1em" viewBox="0 0 30 32" fill="none" xmlns="http://www.w3.org/2000/svg"><path fill="none" stroke="#098f67" style="stroke: var(--color1, #098f67)" stroke-linejoin="miter" stroke-linecap="round" stroke-miterlimit="4" stroke-width="2.4156" d="M17.289 17.305v0c-1.754 1.754-4.597 1.754-6.351 0l-7.76-7.759c-1.755-1.755-1.755-4.601 0-6.356v0c1.753-1.753 4.595-1.756 6.351-0.005l6.208 6.187"></path><path fill="none" stroke="#098f67" style="stroke: var(--color1, #098f67)" stroke-linejoin="miter" stroke-linecap="round" stroke-miterlimit="4" stroke-width="2.4156" d="M12.504 13.97v0c1.754-1.754 4.597-1.754 6.351 0l7.762 7.762c1.754 1.754 1.754 4.597 0 6.351v0c-1.754 1.754-4.597 1.754-6.351 0l-5.953-5.953"></path></svg> <code>pairplot()</code> documentation</a>.


Study More

import unittest
import ast
import inspect
import user_code  # Student's solution file

def _dynamic_test(test_case, condition, success_message, failure_message):
    if condition:
        test_case._testMethodName = success_message
        test_case.assertTrue(True, success_message)
    else:
        test_case._testMethodName = failure_message
        test_case.fail(failure_message)

class TestPairPlot(unittest.TestCase):

    @classmethod
    def setUpClass(cls):
        source = inspect.getsource(user_code)
        cls.tree = ast.parse(source)

    def test_pairplot_called(self):
        pairplot_calls = [
            node for node in ast.walk(self.tree)
            if isinstance(node, ast.Call)
            and (
                (isinstance(node.func, ast.Attribute) and node.func.attr == 'pairplot') or
                (isinstance(node.func, ast.Name) and node.func.id == 'pairplot')
            )
        ]
        _dynamic_test(
            self,
            len(pairplot_calls) > 0,
            "sns.pairplot() is called",
            "sns.pairplot() is not called"
        )
    
    def test_data_argument(self):
        correct = False
        for call in ast.walk(self.tree):
            if isinstance(call, ast.Call):
                if (isinstance(call.func, ast.Attribute) and call.func.attr == 'pairplot') or \
                   (isinstance(call.func, ast.Name) and call.func.id == 'pairplot'):
                    # check first argument or data keyword
                    if call.args:
                        arg0 = call.args[0]
                        if isinstance(arg0, ast.Name) and arg0.id == 'penguins_df':
                            correct = True
                    for kw in call.keywords:
                        if kw.arg == 'data':
                            if isinstance(kw.value, ast.Name) and kw.value.id == 'penguins_df':
                                correct = True
        _dynamic_test(
            self,
            correct,
            "Data argument set to penguins_df",
            "Data argument not set to penguins_df"
        )

    def test_hue_argument(self):
        correct = False
        for call in ast.walk(self.tree):
            if isinstance(call, ast.Call):
                if (isinstance(call.func, ast.Attribute) and call.func.attr == 'pairplot') or \
                   (isinstance(call.func, ast.Name) and call.func.id == 'pairplot'):
                    for kw in call.keywords:
                        if kw.arg == 'hue':
                            if isinstance(kw.value, ast.Constant) and kw.value.value == 'sex':
                                correct = True
        _dynamic_test(
            self,
            correct,
            "hue argument set to 'sex'",
            "hue argument not set to 'sex'"
        )

    def test_kind_argument(self):
        correct = False
        for call in ast.walk(self.tree):
            if isinstance(call, ast.Call):
                if (isinstance(call.func, ast.Attribute) and call.func.attr == 'pairplot') or \
                   (isinstance(call.func, ast.Name) and call.func.id == 'pairplot'):
                    for kw in call.keywords:
                        if kw.arg == 'kind':
                            if isinstance(kw.value, ast.Constant) and kw.value.value == 'reg':
                                correct = True
        _dynamic_test(
            self,
            correct,
            "kind argument set to 'reg'",
            "kind argument not set to 'reg'"
        )

    def test_height_argument(self):
        correct = False
        for call in ast.walk(self.tree):
            if isinstance(call, ast.Call):
                if (isinstance(call.func, ast.Attribute) and call.func.attr == 'pairplot') or \
                   (isinstance(call.func, ast.Name) and call.func.id == 'pairplot'):
                    for kw in call.keywords:
                        if kw.arg == 'height':
                            if (isinstance(kw.value, ast.Constant) and kw.value.value == 2) or \
                               (isinstance(kw.value, ast.Num) and kw.value.n == 2):
                                correct = True
        _dynamic_test(
            self,
            correct,
            "height argument set to 2",
            "height argument not set to 2"
        )

    def test_aspect_argument(self):
        correct = False
        for call in ast.walk(self.tree):
            if isinstance(call, ast.Call):
                if (isinstance(call.func, ast.Attribute) and call.func.attr == 'pairplot') or \
                   (isinstance(call.func, ast.Name) and call.func.id == 'pairplot'):
                    for kw in call.keywords:
                        if kw.arg == 'aspect':
                            # Accept float or constant with float value 0.8
                            if isinstance(kw.value, ast.Constant) and kw.value.value == 0.8:
                                correct = True
                            elif isinstance(kw.value, ast.Num) and abs(kw.value.n - 0.8) < 1e-6:
                                correct = True
        _dynamic_test(
            self,
            correct,
            "aspect argument set to 0.8",
            "aspect argument not set to 0.8"
        )

if __name__ == '__main__':
    unittest.main()


test_pairplot.py

Data is everywhere around us, and making sense of it is extremely important. Visualization helps you deal with data by finding certain patterns and insights in it. You will develop a solid foundation of data visualization using Python and its libraries, such as matplotlib and seaborn, to get as much information from data as possible in a neat and concise way.

Discover the essentials of data visualization with Matplotlib. Learn its core concepts, explore its advantages, and create your first simple plot using this fundamental plotting library.

Master how to visualize data through the most popular plot types. Learn to build line, scatter, and bar charts to clearly communicate insights from your data.

Learn to make your plots more informative and visually appealing. Add titles, legends, colors, and grids, and discover how to arrange multiple subplots effectively.

Explore statistical visualizations that help analyze data distributions and patterns. Create histograms, box plots, and pie charts to uncover deeper statistical insights.

Take your visualization skills to the next level with Seaborn. Create advanced plots like countplots, KDEs, pair plots, and heatmaps while mastering Seaborn’s elegant style and customization options.

Pair Plot

Pair Plot Description

Creating a Pair Plot

Hue

Changing Plot Kinds

Solution