NAME
String::Multibyte::EUC_JP - internally used by String::Multibyte for EUC-JP
SYNOPSIS
use String::Multibyte;
$eucjp = String::Multibyte->new('EUC_JP');
$eucjp_length = $eucjp->length($eucjp_string);
DESCRIPTION
String::Multibyte::EUC_JP
is used for manipulation of strings in EUC-JP family encodings.
Byte range of single-byte characters: 0x00..0x7F
.
First and second byte range of double-byte characters: 0xA1..0xFE
.
Legal byte range for two bytes preceded by SS3: 0xA1..0xFE
.
Character order (invalid code points are excluded): 0x00..0x7F
, 0x8EA1..0x8EFE
, 0xA1A1..0xFEFE
, 0x8FA1A1..0x8FFEFE
.
EUC-JP EUC-JISX0213
1. 0x00..0x7F C0/G0 ASCII ASCII
2. 0x8EA1..0x8EFE G2 JIS kana JIS kana
3. 0xA1A1..0xFEFE G1 JIS X 0208 JIS X 0213 plane-1
4. 0x8FA1A1..0x8FFEFE G3 JIS X 0212 JIS X 0213 plane-2
CAVEAT
C1 controls other than SS2/SS3 (0x80..0x8D
and 0x90..0x9E
) are not supported.
Unassigned G2 (0x8EE0..0x8EFE
) are now included.