Show home:lafenghu / perl-HTML-SimpleParse

Overview Repositories Revisions Requests Users Attributes Meta

a bare-bones HTML parser

This is the HTML::SimpleParse module. It is a bare-bones HTML parser,
similar to HTML::Parser, but with a couple important distinctions:

First, HTML::Parser knows which tags can contain other tags, which
start tags have corresponding end tags, which tags can exist only in
the portion of the document, and so forth. HTML::SimpleParse
does not know any of these things. It just finds tags and text in the
HTML you give it, it does not care about the specific content of these
tags (though it does distiguish between different _types_ of tags, such
as comments, starting tags like , ending tags like , and so on).

Second, HTML::SimpleParse does not create a hierarchical tree of HTML
content, but rather a simple linear list. It does not pay any
attention to balancing start tags with corresponding end tags, or which
pairs of tags are inside other pairs of tags.

Because of these characteristics, you can make a very effective HTML
filter by sub-classing HTML::SimpleParse.

Sources inherited from project openSUSE:12.2
Download package
Checkout Package
osc -A https://api.opensuse.org checkout home:lafenghu/perl-HTML-SimpleParse && cd $_
Create Badge

Build Results
RPM Lint

Refresh

Source Files

Filename	Size	Changed
HTML-SimpleParse-0.12.tar.gz	0000008486 8.29 KB	over 17 years ago
perl-HTML-SimpleParse.changes	0000002408 2.35 KB	almost 13 years ago
perl-HTML-SimpleParse.spec	0000002715 2.65 KB	almost 13 years ago

Latest Revision

Adrian Schröter (adrianSuSE) committed over 12 years ago (revision 1)

branched from openSUSE:Factory

Browse Source

Places

Actions on this page

a bare-bones HTML parser

Edit Package perl-HTML-SimpleParse

Source Files

Latest Revision

Comments 0

Places