NAME

Unicode::EastAsianWidth - East Asian Width properties

SYNOPSIS

use Unicode::EastAsianWidth;

$_ = chr(0x2010); # HYPHEN, an ambiguous-width character

/\p{InEastAsianAmbiguous}/; # True
/\p{InFullwidth}/;          # False

To mark ambiguous-width characters as InFullwidth:

BEGIN { $Unicode::EastAsianWidth::EastAsian = 1; }
use Unicode::EastAsianWidth;

$_ = chr(0x2010); # HYPHEN, an ambiguous-width character
/\p{InFullwidth}/;          # True

DESCRIPTION

This module provide user-defined Unicode properties that deal with width status of East Asian characters, as specified in http://www.unicode.org/unicode/reports/tr11/.

It exports the following functions to the caller's scope, to be used by Perl's Unicode matching system: InEastAsianFullwidth, InEastAsianHalfwidth, InEastAsianAmbiguous, InEastAsianNarrow InEastAsianWide, InEastAsianNeutral.

In accord to TR11 cited above, two additional context-sensitive properties are exported: InFullwidth (union of Fullwidth and Wide) and InHalfwidth (union of Halfwidth, Narrow and Neutral).

Ambiguous characters are treated by default as part of InHalfwidth, but you can modify this behaviour by assigning a true value to $Unicode::EastAsianWidth::EastAsian at compile time within a BEGIN block before loading this module:

BEGIN { $Unicode::EastAsianWidth::EastAsian = 1; }
use Unicode::EastAsianWidth;

Setting $Unicode::EastAsianWidth::EastAsian at run-time used to work on Perl versions between 5.8 and 5.14 due to an implementation detail, but it will no longer work on Perl 5.16 and later versions, and hence is not recommended.

SEE ALSO

perlunicode, http://www.unicode.org/unicode/reports/tr11/

AUTHORS

唐鳳 <cpan@audreyt.org>

CC0 1.0 Universal

To the extent possible under law, 唐鳳 has waived all copyright and related or neighboring rights to Unicode-EastAsianWidth.

This work is published from Taiwan.

http://creativecommons.org/publicdomain/zero/1.0

1 POD Error

The following errors were encountered while parsing the POD:

Around line 486:

Non-ASCII character seen before =encoding in '唐鳳'. Assuming UTF-8