Зміст курсу
Data Preprocessing
Data Preprocessing
Challenge 2
Завдання
So, now let's remember what we do with text data. You will be working with the 'reviews.csv'
dataset, and your task will be to encode text data using the bag of words method.
- Create a
CountVectorizer
object to tokenize text - Fit data to the tokenizer
- Print the vocabulary of the tokenizer
- Encode
'Text'
column - Print the results
Завдання
So, now let's remember what we do with text data. You will be working with the 'reviews.csv'
dataset, and your task will be to encode text data using the bag of words method.
- Create a
CountVectorizer
object to tokenize text - Fit data to the tokenizer
- Print the vocabulary of the tokenizer
- Encode
'Text'
column - Print the results
Все було зрозуміло?
Challenge 2
Завдання
So, now let's remember what we do with text data. You will be working with the 'reviews.csv'
dataset, and your task will be to encode text data using the bag of words method.
- Create a
CountVectorizer
object to tokenize text - Fit data to the tokenizer
- Print the vocabulary of the tokenizer
- Encode
'Text'
column - Print the results
Завдання
So, now let's remember what we do with text data. You will be working with the 'reviews.csv'
dataset, and your task will be to encode text data using the bag of words method.
- Create a
CountVectorizer
object to tokenize text - Fit data to the tokenizer
- Print the vocabulary of the tokenizer
- Encode
'Text'
column - Print the results
Все було зрозуміло?
Challenge 2
Завдання
So, now let's remember what we do with text data. You will be working with the 'reviews.csv'
dataset, and your task will be to encode text data using the bag of words method.
- Create a
CountVectorizer
object to tokenize text - Fit data to the tokenizer
- Print the vocabulary of the tokenizer
- Encode
'Text'
column - Print the results
Завдання
So, now let's remember what we do with text data. You will be working with the 'reviews.csv'
dataset, and your task will be to encode text data using the bag of words method.
- Create a
CountVectorizer
object to tokenize text - Fit data to the tokenizer
- Print the vocabulary of the tokenizer
- Encode
'Text'
column - Print the results
Все було зрозуміло?
Завдання
So, now let's remember what we do with text data. You will be working with the 'reviews.csv'
dataset, and your task will be to encode text data using the bag of words method.
- Create a
CountVectorizer
object to tokenize text - Fit data to the tokenizer
- Print the vocabulary of the tokenizer
- Encode
'Text'
column - Print the results