utf8proc
utf8proc is a library for processing UTF-8 encoded Unicode strings.
Some features are Unicode normalization, stripping of default ignorable characters, case folding and detection of grapheme cluster boundaries. A special character mapping is available, which converts for example the characters “Hyphen” (U+2010), “Minus” (U+2212) and “Hyphen-Minus (U+002D, ASCII Minus) all into the ASCII minus sign, to make them equal for comparisons.
- Sources inherited from project SUSE:SLE-15-SP4:GA
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout SUSE:SLE-15-SP5:GA/utf8proc && cd $_
- Create Badge
Refresh
Refresh
Source Files
Filename | Size | Changed |
---|---|---|
utf8proc-2.6.1.tar.gz | 0000182903 179 KB | |
utf8proc.changes | 0000004013 3.92 KB | |
utf8proc.spec | 0000003492 3.41 KB |
Latest Revision
Gustavo Yokoyama Ribeiro (gyribeiro)
committed
(revision 2)
- update to 2.6.1: - Bugfix in `utf8proc_grapheme_break_stateful` for `NULL` state argument, which also broke `utf8proc_grapheme_break`. - New `utf8proc_islower` and `utf8proc_isupper` functions ([#196]). - Bugfix for manual calls to `grapheme_break_extended` for initial characters ([#205]). - Various build and portability improvements. - utf8proc 2.5.0: * Unicode 13 support * No longer report zero width for category Sk - utf8proc 2.4.0: * Unicode 12.1 support * Fixed incorrect UTF8PROC_VERSION_MINOR version number in header and bumped shared-library version - includes changes from 2.3.0: * Unicode 12 support * New function utf8proc_unicode_version to return the supported Unicode version * Simpler character-width computation that no longer uses GNU Unifont metrics: East-Asian wide characters have width 2, and all other printable characters have width 1 * Fix CHARBOUND option for utf8proc_map to preserve U+FFFE and U+FFFF non-characters - utf8proc 2.2.0: * Unicode 11 support * utf8proc_NFKC_Casefold convenience function for NFKC_Casefold normalization * UTF8PROC_STRIPNA option to strip unassigned codepoints
Comments 0