NAME

SWISH::Filters::Pdf2HTML - Perl extension for filtering PDF documents with Swish-e

DESCRIPTION

This is a plug-in module that uses the xpdf package to convert PDF documents to html for indexing by Swish-e. Any info tags found in the PDF document are created as meta tags.

This filter plug-in requires the xpdf package available at:

http://www.foolabs.com/xpdf/

You may pass into SWISH::Filter's new method a tag to use as the html <title> if found in the PDF info tags:

my %user_data;
$user_data{pdf}{title_tag} = 'title';

$was_filtered = $filter->filter(
    document  => $filename,
    user_data => \%user_data,
);

Then if a PDF info tag of "title" is found that will be used as the HTML <title>. If no tag is passed, title will be used as the default tag.

AUTHOR

Bill Moseley

SEE ALSO

SWISH::Filter