NAME

Catmandu::Importer::PDFPages - Catmandu importer to extract text data per page from one pdf

SYNOPSIS

# From the command line

# Export pdf pages with their text and coördinates

$ catmandu convert PDFPages --file input.pdf to YAML

#In a script

use Catmandu::Sane;

use Catmandu::Importer::PDFPages;

my $importer = Catmandu::Importer::PDFPages->new( file => "/tmp/input.pdf" );

$importer->each(sub{

    my $page = $_[0];
    #..

});

EXAMPLE OUTPUT IN YAML

- label: Cover Page
  height: 878
  width: 595
  text: "Hello world"

AUTHORS

Nicolas Franck <nicolas.franck at ugent.be>

SEE ALSO

Catmandu, Catmandu::Importer , Poppler