Man page - dumppdf(1)
Packages contas this manual
apt-get install python3-pdfminer
Manual
| DUMPPDF(1) | PDFMiner Manual | DUMPPDF(1) |
NAME
dumppdf - dumps internal contents of a PDF files
SYNOPSIS
dumppdf [option...] file...
DESCRIPTION
dumppdf dumps the internal contents of a PDF file in pseudo-XML format. This program is primarily for debugging purposes, but it's also possible to extract some meaningful contents
OPTIONS
-a
-i objno[,objno,...]
-p pageno[,pageno,...]
-r, -b, -t
With -r option, the “raw” stream contents are dumped without decompression. With -b option, the decompressed contents are dumped as a binary blob. With -t option, the decompressed contents are dumped in a text format, similar to repr() manner. When -r or -b option is given, no stream header is displayed for the ease of saving it to a file.
-T
-P password
-d
EXAMPLES
Dump all the headers and contents, except stream objects:
$ dumppdf -a test.pdf
Dump the table of contents:
$ dumppdf -T test.pdf
Extract a JPEG image:
$ dumppdf -r -i6 test.pdf > image.jpeg
SEE ALSO
pdf2txt(1)
AUTHORS
Jakub Wilk <jwilk@debian.org>
Yusuke Shinyama <yusuke@cs.nyu.edu>
| 11/22/2025 | dumppdf |