Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Clean and Convert | Identifying Fake News
Identifying Fake News
course content

Зміст курсу

Identifying Fake News

bookClean and Convert

We have decided to create a complete chapter on the topic of text cleaning and preprocessing. As you may imagine, complete texts cannot be directly fed into an ML model. For this reason, we will apply specific preprocessing techniques.

The first step will be to remove punctuation from our column to reduce noise in our data. We will do this using regex (regular expression matching operations).

Then we will vectorize our text. Refer to the picture below for more information. Essentially, we will represent words, sentences, or even larger units of text as vectors.

Завдання

  1. Remove punctuaction with regex by using the appropriate method to replace the given pattern with an empty string.
  2. Vectorize the texts of the articles.

Mark tasks as Completed
Switch to desktopПерейдіть на комп'ютер для реальної практикиПродовжуйте з того місця, де ви зупинились, використовуючи один з наведених нижче варіантів
Все було зрозуміло?

Як ми можемо покращити це?

Дякуємо за ваш відгук!

We have decided to create a complete chapter on the topic of text cleaning and preprocessing. As you may imagine, complete texts cannot be directly fed into an ML model. For this reason, we will apply specific preprocessing techniques.

The first step will be to remove punctuation from our column to reduce noise in our data. We will do this using regex (regular expression matching operations).

Then we will vectorize our text. Refer to the picture below for more information. Essentially, we will represent words, sentences, or even larger units of text as vectors.

Завдання

  1. Remove punctuaction with regex by using the appropriate method to replace the given pattern with an empty string.
  2. Vectorize the texts of the articles.

Mark tasks as Completed
Switch to desktopПерейдіть на комп'ютер для реальної практикиПродовжуйте з того місця, де ви зупинились, використовуючи один з наведених нижче варіантів
Секція 1. Розділ 4
AVAILABLE TO ULTIMATE ONLY
some-alt