wok view perl-html-parser/description.txt @ rev 25503

f3: include extra programs
author Hans-G?nter Theisgen
date Fri Jan 27 10:37:43 2023 +0100 (15 months ago)
parents
children
line source
1 Objects of the HTML::Parser class will recognize markup and
2 separate it from plain text (alias data content) in HTML
3 documents.
4 As different kinds of markup and text are recognized, the
5 corresponding event handlers are invoked.
7 HTML::Parser is not a generic SGML parser. We have tried to
8 make it able to deal with the HTML that is actually "out there",
9 and it normally parses as closely as possible to the way the
10 popular web browsers do it instead of strictly following one
11 of the many HTML specifications from W3C.
12 Where there is disagreement, there is often an option that
13 you can enable to get the official behaviour.
15 The document to be parsed may be supplied in arbitrary chunks.
16 This makes on-the-fly parsing as documents are received from
17 the network possible.
19 If event driven parsing does not feel right for your application,
20 you might want to use HTML::PullParser.
21 This is an HTML::Parser subclass that allows a more conventional
22 program structure.