Man page - getpdftext(1)
Packages contains this manual
- deillustrate(1)
- getpdfpageobject(1)
- listfonts(1)
- extractjpgs(1)
- getpdftext(1)
- fillpdffields(1)
- pdfinfo.cam-pdf(1)
- renderpdf(1)
- readpdf(1)
- setpdfbackground(1)
- crunchjpgs(1)
- listpdffields(1)
- changepagestring(1)
- extractallimages(1)
- appendpdf(1)
- stamppdf(1)
- deletepdfpage(1)
- rewritepdf(1)
- getpdfpage(1)
- revertpdf(1)
- changerefkeys(1)
- listimages(1)
- uninlinepdfimages(1)
- getpdffontobject(1)
- changepdfstring(1)
- setpdfpage(1)
- replacepdfobj(1)
apt-get install libcam-pdf-perl
Manual
GETPDFTEXT
NAMESYNOPSIS
DESCRIPTION
SEE ALSO
AUTHOR
NAME
getpdftext - Extracts and print the text from one or more PDF pages
SYNOPSIS
getpdftext
[options] infile.pdf [<pagenums>]
Options:
-c --check just validates the page instead of printing it
-g --geometry just computes geometry, prints nothing
-v --verbose print diagnostic messages
-h --help verbose help message
-V --version print CAM::PDF version
<pagenums> is a comma-separated list of page numbers.
Ranges like '2-6' allowed in the list
Example: 4-6,2,12,8-9
DESCRIPTION
Extracts all of the text from the specified PDF page(s) and prints them to STDOUT. If no pages are specified, all pages are processed.
The "--check" and "--geometry" modes are distinctly different. They are used primarily for debugging.
SEE ALSO
CAM::PDF
renderpdf
AUTHOR
See CAM::PDF