Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Data Compression | Results Analysis
Principal Component Analysis
course content

Course Content

Principal Component Analysis

Principal Component Analysis

1. What is Principal Component Analysis
2. Basic Concepts of PCA
3. Model Building
4. Results Analysis

bookData Compression

Before dealing with the task of compressing data with PCA, it is important to understand the difference between data compression and dimensionality reduction. Dimensional data reduction is one type of data compression. Data compression methods are divided into 2 main classes: those in which the processed data can then be restored and those in which it is impossible. Data dimensionality reduction is class 2, i.e. after processing the dataset, we will not be able to restore it back to its original form. More precisely, it can be done, but the data will not be the same, it will be an approximation to the original dataset. It is generally accepted that PCA is not a method for saving storage space, but for performing expensive operations to achieve a similar result. Let's get back to the code. We have the option to choose the amount of data variance we want to keep from the initial dataset. The value of the n_components argument must have been between 0 and 1. In this case, if we specify 0.85, that would be 85% of the stored variance.

Task

Create a PCA model with 90% variance preserved for the iris dataset:

Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
Everything was clear?

How can we improve it?

Thanks for your feedback!

Section 4. Chapter 3
toggle bottom row

bookData Compression

Before dealing with the task of compressing data with PCA, it is important to understand the difference between data compression and dimensionality reduction. Dimensional data reduction is one type of data compression. Data compression methods are divided into 2 main classes: those in which the processed data can then be restored and those in which it is impossible. Data dimensionality reduction is class 2, i.e. after processing the dataset, we will not be able to restore it back to its original form. More precisely, it can be done, but the data will not be the same, it will be an approximation to the original dataset. It is generally accepted that PCA is not a method for saving storage space, but for performing expensive operations to achieve a similar result. Let's get back to the code. We have the option to choose the amount of data variance we want to keep from the initial dataset. The value of the n_components argument must have been between 0 and 1. In this case, if we specify 0.85, that would be 85% of the stored variance.

Task

Create a PCA model with 90% variance preserved for the iris dataset:

Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
Everything was clear?

How can we improve it?

Thanks for your feedback!

Section 4. Chapter 3
toggle bottom row

bookData Compression

Before dealing with the task of compressing data with PCA, it is important to understand the difference between data compression and dimensionality reduction. Dimensional data reduction is one type of data compression. Data compression methods are divided into 2 main classes: those in which the processed data can then be restored and those in which it is impossible. Data dimensionality reduction is class 2, i.e. after processing the dataset, we will not be able to restore it back to its original form. More precisely, it can be done, but the data will not be the same, it will be an approximation to the original dataset. It is generally accepted that PCA is not a method for saving storage space, but for performing expensive operations to achieve a similar result. Let's get back to the code. We have the option to choose the amount of data variance we want to keep from the initial dataset. The value of the n_components argument must have been between 0 and 1. In this case, if we specify 0.85, that would be 85% of the stored variance.

Task

Create a PCA model with 90% variance preserved for the iris dataset:

Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
Everything was clear?

How can we improve it?

Thanks for your feedback!

Before dealing with the task of compressing data with PCA, it is important to understand the difference between data compression and dimensionality reduction. Dimensional data reduction is one type of data compression. Data compression methods are divided into 2 main classes: those in which the processed data can then be restored and those in which it is impossible. Data dimensionality reduction is class 2, i.e. after processing the dataset, we will not be able to restore it back to its original form. More precisely, it can be done, but the data will not be the same, it will be an approximation to the original dataset. It is generally accepted that PCA is not a method for saving storage space, but for performing expensive operations to achieve a similar result. Let's get back to the code. We have the option to choose the amount of data variance we want to keep from the initial dataset. The value of the n_components argument must have been between 0 and 1. In this case, if we specify 0.85, that would be 85% of the stored variance.

Task

Create a PCA model with 90% variance preserved for the iris dataset:

Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
Section 4. Chapter 3
Switch to desktopSwitch to desktop for real-world practiceContinue from where you are using one of the options below
some-alt