Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Removing Duplicates | Data Cleaning
Preprocessing Data
course content

Contenido del Curso

Preprocessing Data

Preprocessing Data

1. Data Exploration
2. Data Cleaning
3. Data Validation
4. Normalization & Standardization
5. Data Encoding

Removing Duplicates

To remove the duplicate rows, simply use function drop_duplicates(). To change the current dataframe, add inplace=True.

123
new_data = data.drop_duplicates() # data is not modified # or data.drop_duplicates(inplace=True) # data is modified
copy

Tarea

The planets dataset is given to you. Remove the duplicates and then check the new shape of dataframe. Compare it with the original shape.

Note that dataframe may have only distinct records, in this case, the shape will remain the same.

Tarea

The planets dataset is given to you. Remove the duplicates and then check the new shape of dataframe. Compare it with the original shape.

Note that dataframe may have only distinct records, in this case, the shape will remain the same.

Cambia al escritorio para practicar en el mundo realContinúe desde donde se encuentra utilizando una de las siguientes opciones

¿Todo estuvo claro?

Sección 2. Capítulo 7
toggle bottom row

Removing Duplicates

To remove the duplicate rows, simply use function drop_duplicates(). To change the current dataframe, add inplace=True.

123
new_data = data.drop_duplicates() # data is not modified # or data.drop_duplicates(inplace=True) # data is modified
copy

Tarea

The planets dataset is given to you. Remove the duplicates and then check the new shape of dataframe. Compare it with the original shape.

Note that dataframe may have only distinct records, in this case, the shape will remain the same.

Tarea

The planets dataset is given to you. Remove the duplicates and then check the new shape of dataframe. Compare it with the original shape.

Note that dataframe may have only distinct records, in this case, the shape will remain the same.

Cambia al escritorio para practicar en el mundo realContinúe desde donde se encuentra utilizando una de las siguientes opciones

¿Todo estuvo claro?

Sección 2. Capítulo 7
toggle bottom row

Removing Duplicates

To remove the duplicate rows, simply use function drop_duplicates(). To change the current dataframe, add inplace=True.

123
new_data = data.drop_duplicates() # data is not modified # or data.drop_duplicates(inplace=True) # data is modified
copy

Tarea

The planets dataset is given to you. Remove the duplicates and then check the new shape of dataframe. Compare it with the original shape.

Note that dataframe may have only distinct records, in this case, the shape will remain the same.

Tarea

The planets dataset is given to you. Remove the duplicates and then check the new shape of dataframe. Compare it with the original shape.

Note that dataframe may have only distinct records, in this case, the shape will remain the same.

Cambia al escritorio para practicar en el mundo realContinúe desde donde se encuentra utilizando una de las siguientes opciones

¿Todo estuvo claro?

To remove the duplicate rows, simply use function drop_duplicates(). To change the current dataframe, add inplace=True.

123
new_data = data.drop_duplicates() # data is not modified # or data.drop_duplicates(inplace=True) # data is modified
copy

Tarea

The planets dataset is given to you. Remove the duplicates and then check the new shape of dataframe. Compare it with the original shape.

Note that dataframe may have only distinct records, in this case, the shape will remain the same.

Cambia al escritorio para practicar en el mundo realContinúe desde donde se encuentra utilizando una de las siguientes opciones
Sección 2. Capítulo 7
Cambia al escritorio para practicar en el mundo realContinúe desde donde se encuentra utilizando una de las siguientes opciones
We're sorry to hear that something went wrong. What happened?
some-alt