NAME
Catmandu::Importer::HTML - An HTML importer
SYNOPSIS
# From the command line
$ catmandu convert HTML to YAML < ex/test.html
# From Perl
use Catmandu;
my $importer = Catmandu->importer('HTML',file => 'ex/test.html');
my $n = $importer->each(sub {
my $hashref = $_[0];
# ...
});
DESCRIPTION
This is a Catmandu::Importer for converting HTML data using the HTML::TokeParser parser.
CONFIGURATION
- file
-
Read input from a local file given by its path. Alternatively a scalar reference can be passed to read from a string.
- fh
-
Read input from an IO::Handle. If not specified, Catmandu::Util::io is used to create the input stream from the
file
argument or by using STDIN. - encoding
-
Binmode of the input stream
fh
. Set to:utf8
by default. - fix
-
An ARRAY of one or more fixes or file scripts to be applied to imported items.
METHODS
Every Catmandu::Importer is a Catmandu::Iterable all its methods are inherited.