197 lines
4.4 KiB
HTML
197 lines
4.4 KiB
HTML
|
|
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
|
|
<HTML><HEAD><TITLE>Man page of PDFTOHTML</TITLE>
|
|
</HEAD><BODY>
|
|
<H1>PDFTOHTML</H1>
|
|
Section: User Commands (1)<BR><A HREF="#index">Index</A>
|
|
<A HREF="/cgi-bin/man/man2html">Return to Main Contents</A><HR>
|
|
|
|
|
|
|
|
<A NAME="lbAB"> </A>
|
|
<H2>NAME</H2>
|
|
|
|
pdftohtml - program to convert PDF files into HTML, XML and PNG images
|
|
<A NAME="lbAC"> </A>
|
|
<H2>SYNOPSIS</H2>
|
|
|
|
<B>pdftohtml</B>
|
|
|
|
<I>[options] <PDF-file> [<HTML-file> <XML-file>]</I>
|
|
|
|
<A NAME="lbAD"> </A>
|
|
<H2>DESCRIPTION</H2>
|
|
|
|
This manual page documents briefly the
|
|
<B>pdftohtml</B>
|
|
|
|
command.
|
|
This manual page was written for the Debian GNU/Linux distribution
|
|
because the original program does not have a manual page.
|
|
<P>
|
|
|
|
<B>pdftohtml</B>
|
|
|
|
is a program that converts PDF documents into HTML. It generates its output in
|
|
the current working directory.
|
|
<A NAME="lbAE"> </A>
|
|
<H2>OPTIONS</H2>
|
|
|
|
A summary of options are included below.
|
|
<DL COMPACT>
|
|
<DT id="1"><B>-h, -help</B>
|
|
|
|
<DD>
|
|
Show summary of options.
|
|
<DT id="2"><B>-f <int></B>
|
|
|
|
<DD>
|
|
first page to print
|
|
<DT id="3"><B>-l <int></B>
|
|
|
|
<DD>
|
|
last page to print
|
|
<DT id="4"><B>-q</B>
|
|
|
|
<DD>
|
|
do not print any messages or errors
|
|
<DT id="5"><B>-v</B>
|
|
|
|
<DD>
|
|
print copyright and version info
|
|
<DT id="6"><B>-p</B>
|
|
|
|
<DD>
|
|
exchange .pdf links with .html
|
|
<DT id="7"><B>-c</B>
|
|
|
|
<DD>
|
|
generate complex output
|
|
<DT id="8"><B>-s</B>
|
|
|
|
<DD>
|
|
generate single HTML that includes all pages
|
|
<DT id="9"><B>-dataurls</B>
|
|
|
|
<DD>
|
|
use data URLs instead of external images in HTML. No available in all platforms
|
|
<DT id="10"><B>-i</B>
|
|
|
|
<DD>
|
|
ignore images
|
|
<DT id="11"><B>-noframes</B>
|
|
|
|
<DD>
|
|
generate no frames. Not supported in complex output mode.
|
|
<DT id="12"><B>-stdout</B>
|
|
|
|
<DD>
|
|
use standard output
|
|
<DT id="13"><B>-zoom <fp></B>
|
|
|
|
<DD>
|
|
zoom the PDF document (default 1.5)
|
|
<DT id="14"><B>-xml</B>
|
|
|
|
<DD>
|
|
output for XML post-processing
|
|
<DT id="15"><B>-noRoundedCoordinates</B>
|
|
|
|
<DD>
|
|
do not round coordinates (with XML output only)
|
|
<DT id="16"><B>-enc <string></B>
|
|
|
|
<DD>
|
|
output text encoding name
|
|
<DT id="17"><B>-opw <string></B>
|
|
|
|
<DD>
|
|
owner password (for encrypted files)
|
|
<DT id="18"><B>-upw <string></B>
|
|
|
|
<DD>
|
|
user password (for encrypted files)
|
|
<DT id="19"><B>-hidden</B>
|
|
|
|
<DD>
|
|
force hidden text extraction
|
|
<DT id="20"><B>-fmt</B>
|
|
|
|
<DD>
|
|
image file format for Splash output (png or jpg).
|
|
If complex is selected, but -fmt is not specified,
|
|
-fmt png will be assumed
|
|
<DT id="21"><B>-nomerge</B>
|
|
|
|
<DD>
|
|
do not merge paragraphs
|
|
<DT id="22"><B>-nodrm</B>
|
|
|
|
<DD>
|
|
override document DRM settings
|
|
<DT id="23"><B>-wbt <fp></B>
|
|
|
|
<DD>
|
|
adjust the word break threshold percent. Default is 10.
|
|
Word break occurs when distance between two adjacent characters is
|
|
greater than this percent of character height.
|
|
<DT id="24"><B>-fontfullname</B>
|
|
|
|
<DD>
|
|
outputs the font name without any substitutions.
|
|
<P>
|
|
</DL>
|
|
<A NAME="lbAF"> </A>
|
|
<H2>AUTHOR</H2>
|
|
|
|
<P>
|
|
Pdftohtml was developed by Gueorgui Ovtcharov and Rainer Dorsch. It is
|
|
based and benefits a lot from Derek Noonburg's xpdf package.
|
|
<P>
|
|
This manual page was written by Søren Boll Overgaard <<A HREF="mailto:boll@debian.org">boll@debian.org</A>>,
|
|
for the Debian GNU/Linux system (but may be used by others).
|
|
<A NAME="lbAG"> </A>
|
|
<H2>SEE ALSO</H2>
|
|
|
|
<B><A HREF="/cgi-bin/man/man2html?1+pdfdetach">pdfdetach</A></B>(1),
|
|
|
|
<B><A HREF="/cgi-bin/man/man2html?1+pdffonts">pdffonts</A></B>(1),
|
|
|
|
<B><A HREF="/cgi-bin/man/man2html?1+pdfimages">pdfimages</A></B>(1),
|
|
|
|
<B><A HREF="/cgi-bin/man/man2html?1+pdfinfo">pdfinfo</A></B>(1),
|
|
|
|
<B><A HREF="/cgi-bin/man/man2html?1+pdftocairo">pdftocairo</A></B>(1),
|
|
|
|
<B><A HREF="/cgi-bin/man/man2html?1+pdftoppm">pdftoppm</A></B>(1),
|
|
|
|
<B><A HREF="/cgi-bin/man/man2html?1+pdftops">pdftops</A></B>(1),
|
|
|
|
<B><A HREF="/cgi-bin/man/man2html?1+pdftotext">pdftotext</A></B>(1)
|
|
|
|
<B><A HREF="/cgi-bin/man/man2html?1+pdfseparate">pdfseparate</A></B>(1),
|
|
|
|
<B><A HREF="/cgi-bin/man/man2html?1+pdfsig">pdfsig</A></B>(1),
|
|
|
|
<B><A HREF="/cgi-bin/man/man2html?1+pdfunite">pdfunite</A></B>(1)
|
|
|
|
<P>
|
|
|
|
<HR>
|
|
<A NAME="index"> </A><H2>Index</H2>
|
|
<DL>
|
|
<DT id="25"><A HREF="#lbAB">NAME</A><DD>
|
|
<DT id="26"><A HREF="#lbAC">SYNOPSIS</A><DD>
|
|
<DT id="27"><A HREF="#lbAD">DESCRIPTION</A><DD>
|
|
<DT id="28"><A HREF="#lbAE">OPTIONS</A><DD>
|
|
<DT id="29"><A HREF="#lbAF">AUTHOR</A><DD>
|
|
<DT id="30"><A HREF="#lbAG">SEE ALSO</A><DD>
|
|
</DL>
|
|
<HR>
|
|
This document was created by
|
|
<A HREF="/cgi-bin/man/man2html">man2html</A>,
|
|
using the manual pages.<BR>
|
|
Time: 00:05:22 GMT, March 31, 2021
|
|
</BODY>
|
|
</HTML>
|