Character encoding aliases for legacy web content

Edit Package python-webencodings

This is a Python implementation of the WHATWG Encoding standard.

* Latest documentation: http://packages.python.org/webencodings/
* Source code and issue tracker: https://github.com/gsnedders/python-webencodings
* PyPI releases: http://pypi.python.org/pypi/webencodings
* License: BSD
* Python 2.6+ and 3.3+

In order to be compatible with legacy web content when interpreting something like Content-Type: text/html; charset=latin1, tools need to use a particular set of aliases for encoding labels as well as some overriding rules. For example, US-ASCII and iso-8859-1 on the web are actually aliases for windows-1252, and an UTF-8 or UTF-16 BOM takes precedence over any other encoding declaration. The Encoding standard defines all such details so that implementations do not have to reverse-engineer each other.

This module has encoding labels and BOM detection, but the actual implementation for encoders and decoders is Python’s.

Refresh
Refresh
Source Files
Filename Size Changed
LICENSE 0000001490 1.46 KB
python-webencodings.changes 0000000996 996 Bytes
python-webencodings.spec 0000002563 2.5 KB
webencodings-0.5.1.tar.gz 0000009721 9.49 KB
Latest Revision
Marcus Rueckert's avatar Marcus Rueckert (darix) committed (revision 1)
Set link to python-webencodings.26469 via maintenance_release request
Comments 0
openSUSE Build Service is sponsored by