Parsing and extracting information from (possibly malformed) HTML/XML documents
http://hackage.haskell.org/package/tagsoup
TagSoup is a library for parsing HTML/XML. It supports the HTML 5
specification, and can be used to parse either well-formed XML, or
unstructured and malformed HTML from the web. The library also provides
useful functions to extract information from an HTML document, making
it ideal for screen-scraping.
Users should start from the Text.HTML.TagSoup module.
- Links to openSUSE:Factory / ghc-tagsoup
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout home:dirkmueller:AL:TW/ghc-tagsoup && cd $_
- Create Badge
Refresh
Refresh
Source Files (show merged sources derived from linked package)
Filename | Size | Changed |
---|---|---|
_link | 0000000702 702 Bytes |
Latest Revision
Dirk Mueller (dirkmueller)
committed
(revision 1)
Comments 0