NAME
Net::IDN::Encode - Internationalizing Domain Names in Applications (RFC 3490)
SYNOPSIS
use Net::IDN::Encode ':all';
my $a = domain_to_ascii("müller.example.org");
my $e = email_to_ascii("POSTMASTER@例。テスト");
my $u = domain_to_unicode('EXAMPLE.XN--11B5BS3A9AJ6G');
DESCRIPTION
This module provides an easy-to-use interface for encoding and decoding Internationalized Domain Names (IDNs).
IDNs use characters drawn from a large repertoire (Unicode), but IDNA allows the non-ASCII characters to be represented using only the ASCII characters already allowed in so-called host names today (letter-digit-hypen, /[A-Z0-9-]/i
).
FUNCTIONS
By default, this module does not export any subroutines. You may use the :all
tag to import everything. You can also use regular expressions such as /^to_/
or /^email_/
to select some of the functions, see Exporter for details.
The following functions are available:
- to_ascii( $label [, 'UseSTD3ASCIIRules' => 1 ] )
-
Converts a single label
$label
to ASCII. Will throw an exception on invalid input.This function takes the following parameter:
- UseSTD3ASCIIRules
-
(boolean) If set to a true value, checks the label for compliance with STD 3 (RFC 1123) syntax for host name parts.
This function does not try to handle strings that consist of multiple lables (such as domain names).
- to_unicode( $label )
-
Converts a single label
$label
to Unicode. Will throw an exception on invalid input.This function does not try to handle strings that consist of multiple lables (such as domain names).
- domain_to_unicode( $domain )
-
Converts all labels of the hostname
$domain
(with labels seperated by dots) to Unicode. Will throw an exception on invalid input. - domain_to_ascii( $domain )
-
Converts all labels of the hostname
$domain
(with labels seperated by dots) to ASCII. Will throw an exception on invalid input.The following characters are recognized as dots: U+002E (full stop), U+3002 (ideographic full stop), U+FF0E (fullwidth full stop), U+FF61 (halfwidth ideographic full stop).
- domain_to_unicode( $domain )
-
Converts all labels of the hostname
$domain
(with labels seperated by dots) to Unicode. Will throw an exception on invalid input.The following characters are recognized as dots: U+002E (full stop), U+3002 (ideographic full stop), U+FF0E (fullwidth full stop), U+FF61 (halfwidth ideographic full stop).
- email_to_ascii( $email )
-
Converts the domain part (right hand side, separated by an at sign) of the RFC 2821/2822 email address to ASCII. May throw an exception on invalid input.
This function currently does not handle internationalization of the local-part (left hand side). This may change in future versions.
The follwing characters are recognized as at signs: U+0040 (commercial at), U+FF20 (fullwidth commercial at).
- email_to_unicode( $email )
-
Converts the domain part (right hand side, separated by an at sign) of the RFC 2821/2822 email address to Unicode. May throw an exception on invalid input.
This function currently does not handle internationalization of the local-part (left hand side). This may change in future versions.
The follwing characters are recognized as at signs: U+0040 (commercial at), U+FF20 (fullwidth commercial at).
AUTHOR
Claus Färber <CFAERBER@cpan.org>
LICENSE
Copyright 2007-2010 Claus Färber.
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
SEE ALSO
Net::IDN::Nameprep, Net::IDN::Punycode, RFC 3490 (http://www.ietf.org/rfc/rfc3490.txt)