stringprep_utf8_nfkc_normalize
Section: libidn (3)
Updated: 1.9
Index
Return to Main Contents
NAME
stringprep_utf8_nfkc_normalize - normalize Unicode string
SYNOPSIS
#include <stringprep.h>
char * stringprep_utf8_nfkc_normalize(const char * str, ssize_t len);
ARGUMENTS
- const char * str
-
a UTF-8 encoded string.
- ssize_t len
-
length of str, in bytes, or -1 if str is nul-terminated.
DESCRIPTION
Converts a string into canonical form, standardizing
such issues as whether a character with an accent
is represented as a base character and combining
accent or as a single precomposed character.
The normalization mode is NFKC (ALL COMPOSE). It standardizes
differences that do not affect the text content, such as the
above-mentioned accent representation. It standardizes the
"compatibility" characters in Unicode, such as SUPERSCRIPT THREE to
the standard forms (in this case DIGIT THREE). Formatting
information may be lost but for most text operations such
characters should be considered the same. It returns a result with
composed forms rather than a maximally decomposed form.
RETURN VALUE
a newly allocated string, that is the
NFKC normalized form of str.
REPORTING BUGS
Report bugs to <bug-libidn@gnu.org>.
COPYRIGHT
Copyright © 2002, 2003, 2004, 2005, 2006, 2007, 2008 Simon Josefsson.
Permission is granted to make and distribute verbatim copies of this
manual provided the copyright notice and this permission notice are
preserved on all copies.
SEE ALSO
The full documentation for
libidn
is maintained as a Texinfo manual. If the
info
and
libidn
programs are properly installed at your site, the command
-
info libidn
should give you access to the complete manual.