Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Removing Duplicates | Data Cleaning
Preprocessing Data
course content

Course Content

Preprocessing Data

Preprocessing Data

1. Data Exploration
2. Data Cleaning
3. Data Validation
4. Normalization & Standardization
5. Data Encoding

Removing Duplicates

To remove the duplicate rows, simply use function drop_duplicates(). To change the current dataframe, add inplace=True.

123
new_data = data.drop_duplicates() # data is not modified # or data.drop_duplicates(inplace=True) # data is modified
copy

Task

The planets dataset is given to you. Remove the duplicates and then check the new shape of dataframe. Compare it with the original shape.

Note that dataframe may have only distinct records, in this case, the shape will remain the same.

Task

The planets dataset is given to you. Remove the duplicates and then check the new shape of dataframe. Compare it with the original shape.

Note that dataframe may have only distinct records, in this case, the shape will remain the same.

Switch to desktop for real-world practiceContinue from where you are using one of the options below

Everything was clear?

Section 2. Chapter 7
toggle bottom row

Removing Duplicates

To remove the duplicate rows, simply use function drop_duplicates(). To change the current dataframe, add inplace=True.

123
new_data = data.drop_duplicates() # data is not modified # or data.drop_duplicates(inplace=True) # data is modified
copy

Task

The planets dataset is given to you. Remove the duplicates and then check the new shape of dataframe. Compare it with the original shape.

Note that dataframe may have only distinct records, in this case, the shape will remain the same.

Task

The planets dataset is given to you. Remove the duplicates and then check the new shape of dataframe. Compare it with the original shape.

Note that dataframe may have only distinct records, in this case, the shape will remain the same.

Switch to desktop for real-world practiceContinue from where you are using one of the options below

Everything was clear?

Section 2. Chapter 7
toggle bottom row

Removing Duplicates

To remove the duplicate rows, simply use function drop_duplicates(). To change the current dataframe, add inplace=True.

123
new_data = data.drop_duplicates() # data is not modified # or data.drop_duplicates(inplace=True) # data is modified
copy

Task

The planets dataset is given to you. Remove the duplicates and then check the new shape of dataframe. Compare it with the original shape.

Note that dataframe may have only distinct records, in this case, the shape will remain the same.

Task

The planets dataset is given to you. Remove the duplicates and then check the new shape of dataframe. Compare it with the original shape.

Note that dataframe may have only distinct records, in this case, the shape will remain the same.

Switch to desktop for real-world practiceContinue from where you are using one of the options below

Everything was clear?

To remove the duplicate rows, simply use function drop_duplicates(). To change the current dataframe, add inplace=True.

123
new_data = data.drop_duplicates() # data is not modified # or data.drop_duplicates(inplace=True) # data is modified
copy

Task

The planets dataset is given to you. Remove the duplicates and then check the new shape of dataframe. Compare it with the original shape.

Note that dataframe may have only distinct records, in this case, the shape will remain the same.

Switch to desktop for real-world practiceContinue from where you are using one of the options below
Section 2. Chapter 7
Switch to desktop for real-world practiceContinue from where you are using one of the options below
We're sorry to hear that something went wrong. What happened?
some-alt