NAME

String::Multibyte::EUC_JP - internally used by String::Multibyte for EUC-JP

SYNOPSIS

use String::Multibyte;

$eucjp = String::Multibyte->new('EUC_JP');
$eucjp_length = $eucjp->length($eucjp_string);

DESCRIPTION

String::Multibyte::EUC_JP is used for manipulation of strings in EUC-JP family encodings.

Byte range of single-byte characters: 0x00..0x7F.

First and second byte range of double-byte characters: 0xA1..0xFE.

Legal byte range for two bytes preceded by SS3: 0xA1..0xFE.

Character order (invalid code points are excluded): 0x00..0x7F, 0x8EA1..0x8EFE, 0xA1A1..0xFEFE, 0x8FA1A1..0x8FFEFE.

                                     EUC-JP         EUC-JISX0213
1.  0x00..0x7F         C0/G0      ASCII          ASCII
2.  0x8EA1..0x8EFE       G2       JIS kana       JIS kana
3.  0xA1A1..0xFEFE       G1       JIS X 0208     JIS X 0213 plane-1
4.  0x8FA1A1..0x8FFEFE   G3       JIS X 0212     JIS X 0213 plane-2

CAVEAT

C1 controls other than SS2/SS3 (0x80..0x8D and 0x90..0x9E) are not supported.

Unassigned G2 (0x8EE0..0x8EFE) are now included.

SEE ALSO

String::Multibyte