Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Preliminary Analysis | Clustering Demystified
Clustering Demystified
course content

Зміст курсу

Clustering Demystified

bookPreliminary Analysis

Preliminary analysis involves initial exploration and understanding of data to identify patterns, trends, or anomalies. It serves as a foundation for more in-depth analysis and decision-making in various domains such as business, research, and data science.

Methods description

  • print: This is a built-in Python function used to display the value of an expression. It prints the specified message or variable to the standard output (usually the console);
  • shape: This is a method available in data structures like Pandas DataFrame or NumPy array. It returns a tuple representing the dimensions of the data structure, often in the format (rows, columns). In this context, it prints the shape of the data, i.e., the number of rows and columns;
  • isnull(): This is a method available in Pandas DataFrame which returns a boolean DataFrame indicating whether each element in the DataFrame is NaN (missing) or not;
  • sum(): This is a method available in Pandas DataFrame which returns the sum of values for the requested axis. When used after isnull(), it computes the sum of missing values along the specified axis (usually axis=0 for columns). In this context, it prints the total number of missing values in each column.

Завдання

  1. Print the shape of your data.
  2. Check for any NaN value.

Mark tasks as Completed
Switch to desktopПерейдіть на комп'ютер для реальної практикиПродовжуйте з того місця, де ви зупинились, використовуючи один з наведених нижче варіантів
Все було зрозуміло?

Як ми можемо покращити це?

Дякуємо за ваш відгук!

Preliminary analysis involves initial exploration and understanding of data to identify patterns, trends, or anomalies. It serves as a foundation for more in-depth analysis and decision-making in various domains such as business, research, and data science.

Methods description

  • print: This is a built-in Python function used to display the value of an expression. It prints the specified message or variable to the standard output (usually the console);
  • shape: This is a method available in data structures like Pandas DataFrame or NumPy array. It returns a tuple representing the dimensions of the data structure, often in the format (rows, columns). In this context, it prints the shape of the data, i.e., the number of rows and columns;
  • isnull(): This is a method available in Pandas DataFrame which returns a boolean DataFrame indicating whether each element in the DataFrame is NaN (missing) or not;
  • sum(): This is a method available in Pandas DataFrame which returns the sum of values for the requested axis. When used after isnull(), it computes the sum of missing values along the specified axis (usually axis=0 for columns). In this context, it prints the total number of missing values in each column.

Завдання

  1. Print the shape of your data.
  2. Check for any NaN value.

Mark tasks as Completed
Switch to desktopПерейдіть на комп'ютер для реальної практикиПродовжуйте з того місця, де ви зупинились, використовуючи один з наведених нижче варіантів
Секція 1. Розділ 3
AVAILABLE TO ULTIMATE ONLY
some-alt