NAME
Badger::Codec::HTML - encode and decode reserved characters in HTML
SYNOPSIS
use Badger::Codec::HTML;
# class methods
my $enc = Badger::Codec::HTML->encode("http://foo.com/bar.html");
my $dec = $codec->decode($enc);
# object methods
my $codec = Badger::Codec::HTML->new();
my $enc = $codec->encode("http://foo.com/bar.html");
my $dec = $codec->decode($enc);
DESCRIPTION
This module implements a subclass of Badger::Codec for encoding and decoding HTML. It is based on code extracted from Lincoln Stein's CGI.pm module.
The encode() method encodes HTML by converting any reserved characters to the correct HTML entities.
The decode() method reverses this process.
METHODS
encode($html, $charset)
Encodes the HTML text passed as the first argument.
$encoded = Badger::Codec::HTML->encode($html);
The optional second argument can be used to indicate the character set in use. If this is set to ISO-8859-1
WINDOWS-1252
then the encoded data will undergo some additional processing in order to work around some known bugs in Microsoft's web browsers. See fix_windows().
decode($html)
Decodes the encoded HTML text passed as the first argument.
$html = Badger::Codec::HTML->decode($encoded);
fix_windows($text)
This method is used internally to repair the damage caused by bugs in certain inferior browsers.
AUTHOR
Andy Wardley http://wardley.org/
COPYRIGHT
Copyright (C) 2005-2009 Andy Wardley. All rights reserved.
ACKNOWLEDGEMENTS
This code is derived from Lincoln D. Stein's CGI module.