Parsing and extracting information from (possibly malformed) HTML/XML documents
TagSoup is a library for parsing HTML/XML. It supports the HTML 5
specification, and can be used to parse either well-formed XML, or
unstructured and malformed HTML from the web. The library also provides
useful functions to extract information from an HTML document, making
it ideal for screen-scraping.
Users should start from the Text.HTML.TagSoup module.
-
1
derived packages
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout openSUSE:Leap:42.2:Update/ghc-tagsoup.6476 && cd $_
- Create Badge
Refresh
Refresh
Source Files
Filename | Size | Changed |
---|---|---|
ghc-tagsoup.changes | 0000002708 2.64 KB | |
ghc-tagsoup.spec | 0000002479 2.42 KB | |
tagsoup-0.13.10.tar.gz | 0000044587 43.5 KB |
Latest Revision
Jürgen Löhel (jloehel)
accepted
request 483151
from
Benjamin Brunner (BenniBrunner)
(revision 1)
Release from openSUSE:Maintenance:6476 / ghc-tagsoup.openSUSE_Leap_42.2_Update
Comments 0