Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Leer Ways to Scrape Table | Tables
Web Scraping with Python (res)

bookWays to Scrape Table

There are a lot of different ways to scrap tables. The method depends on the structure of the table.

You can apply string methods. As you remember, the function find_all() looks through a tag’s descendants and retrieves all descendants that match the parameter. If we apply it to the tag <tr> the result will be a list of contents of each <tr> tag.

1
rows = html.find_all('tr')
copy

You can also do it using XPath:

rows = html.xpath('//tr')

Then we can clean the data and convert it to the DataFrame. This method can be useful if the table has a complex and confusing structure.

question mark

To get tags we can use:

Select the correct answer

Was alles duidelijk?

Hoe kunnen we het verbeteren?

Bedankt voor je feedback!

Sectie 4. Hoofdstuk 2

Vraag AI

expand

Vraag AI

ChatGPT

Vraag wat u wilt of probeer een van de voorgestelde vragen om onze chat te starten.

Suggested prompts:

Stel mij vragen over dit onderwerp

Vat dit hoofdstuk samen

Toon voorbeelden uit de praktijk

Awesome!

Completion rate improved to 4.76

bookWays to Scrape Table

Veeg om het menu te tonen

There are a lot of different ways to scrap tables. The method depends on the structure of the table.

You can apply string methods. As you remember, the function find_all() looks through a tag’s descendants and retrieves all descendants that match the parameter. If we apply it to the tag <tr> the result will be a list of contents of each <tr> tag.

1
rows = html.find_all('tr')
copy

You can also do it using XPath:

rows = html.xpath('//tr')

Then we can clean the data and convert it to the DataFrame. This method can be useful if the table has a complex and confusing structure.

question mark

To get tags we can use:

Select the correct answer

Was alles duidelijk?

Hoe kunnen we het verbeteren?

Bedankt voor je feedback!

Sectie 4. Hoofdstuk 2
some-alt