Unicode (ISO-UCS) characters.
This module implements Unicode (actually ISO-UCS) characters. All 31-bit code points are allowed.
Unicode characters. All 31bit code points are allowed.
char_of u
returns the Latin-1 representation of u
.
If u
can not be represented by Latin-1, raises Out_of_range
code u
returns the Unicode code number of u
.
If the value can not be represented by a positive integer,
raise Out_of_range
code n
returns the Unicode character with the code number n
.
If n >= 2^32 or n < 0, raises invalid_arg
uint_code u
returns the Unicode code number of u
.
The returned int is unsigned, that is, on 32-bits platforms,
the sign bit is used for storing the 31-th bit of the code number.
chr_of_uint n
returns the Unicode character of the code number n
.
n
is interpreted as unsigned, that is, on 32-bits platforms,
the sign bit is treated as the 31-th bit of the code number.
If n exceed 31-bits values, then raise invalid_arg
.