Skip to content
Prev 169222 / 398506 Next

How to import HTML and SQL files

Dieter Menne wrote:
The htmlParse() and htmlTreeParse() functions in the XML package
use the non-strict HTML parser in libxml2 and so the HTML document
can be malformed.  That parser tends to be quite tolerant so that
you get an HTML tree back, even if the ambiguities in the original
HTML document lead to a tree that one might not expect.

I've not had any troubles parsing HTML files with it.

D.