Contenido del Curso
Web Scraping with Python (res)
Web Scraping with Python (res)
Hierarchy in the HTML
As you probably know, HTML tags are nested within other tags. They form an inheritance tree, where nested tags are "children" of other tags:
Here the inheritance goes from top to bottom. For example, the two p
elements are children of the same parent div
, head
and body
elements are descendants (second generation) of html
.
In the future, it will be essential to understand the structure of the HTML file to write paths to the tags we want to extract; however, we will consider methods that can be used by searching tags or attributes.
¡Gracias por tus comentarios!