The London Perl and Raku Workshop takes place on 26th Oct 2024. If your company depends on Perl, please consider sponsoring and/or attending.

NAME

getpdftext.pl - Extracts and print the text from one or more PDF pages

SYNOPSIS

getpdftext.pl [options] infile.pdf [<pagenums>]

Options:
  -c --check          just validates the page instead of printing it
  -g --geometry       just computes geometry, prints nothing
  -v --verbose        print diagnostic messages
  -h --help           verbose help message
  -V --version        print CAM::PDF version

<pagenums> is a comma-separated list of page numbers.
     Ranges like '2-6' allowed in the list
     Example: 4-6,2,12,8-9

DESCRIPTION

Extracts all of the text from the specified PDF page(s) and prints them to STDOUT. If no pages are specified, all pages are processed.

The --check and --geometry modes are distinctly different. They are used primarily for debugging.

SEE ALSO

CAM::PDF

renderpdf.pl

AUTHOR

Clotho Advanced Media Inc., cpan@clotho.com