man-pages/man1/pdftohtml.1.html
2021-03-31 01:06:50 +01:00

197 lines
4.4 KiB
HTML

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<HTML><HEAD><TITLE>Man page of PDFTOHTML</TITLE>
</HEAD><BODY>
<H1>PDFTOHTML</H1>
Section: User Commands (1)<BR><A HREF="#index">Index</A>
<A HREF="/cgi-bin/man/man2html">Return to Main Contents</A><HR>
<A NAME="lbAB">&nbsp;</A>
<H2>NAME</H2>
pdftohtml - program to convert PDF files into HTML, XML and PNG images
<A NAME="lbAC">&nbsp;</A>
<H2>SYNOPSIS</H2>
<B>pdftohtml</B>
<I>[options] &lt;PDF-file&gt; [&lt;HTML-file&gt; &lt;XML-file&gt;]</I>
<A NAME="lbAD">&nbsp;</A>
<H2>DESCRIPTION</H2>
This manual page documents briefly the
<B>pdftohtml</B>
command.
This manual page was written for the Debian GNU/Linux distribution
because the original program does not have a manual page.
<P>
<B>pdftohtml</B>
is a program that converts PDF documents into HTML. It generates its output in
the current working directory.
<A NAME="lbAE">&nbsp;</A>
<H2>OPTIONS</H2>
A summary of options are included below.
<DL COMPACT>
<DT id="1"><B>-h, -help</B>
<DD>
Show summary of options.
<DT id="2"><B>-f &lt;int&gt;</B>
<DD>
first page to print
<DT id="3"><B>-l &lt;int&gt;</B>
<DD>
last page to print
<DT id="4"><B>-q</B>
<DD>
do not print any messages or errors
<DT id="5"><B>-v</B>
<DD>
print copyright and version info
<DT id="6"><B>-p</B>
<DD>
exchange .pdf links with .html
<DT id="7"><B>-c</B>
<DD>
generate complex output
<DT id="8"><B>-s</B>
<DD>
generate single HTML that includes all pages
<DT id="9"><B>-dataurls</B>
<DD>
use data URLs instead of external images in HTML. No available in all platforms
<DT id="10"><B>-i</B>
<DD>
ignore images
<DT id="11"><B>-noframes</B>
<DD>
generate no frames. Not supported in complex output mode.
<DT id="12"><B>-stdout</B>
<DD>
use standard output
<DT id="13"><B>-zoom &lt;fp&gt;</B>
<DD>
zoom the PDF document (default 1.5)
<DT id="14"><B>-xml</B>
<DD>
output for XML post-processing
<DT id="15"><B>-noRoundedCoordinates</B>
<DD>
do not round coordinates (with XML output only)
<DT id="16"><B>-enc &lt;string&gt;</B>
<DD>
output text encoding name
<DT id="17"><B>-opw &lt;string&gt;</B>
<DD>
owner password (for encrypted files)
<DT id="18"><B>-upw &lt;string&gt;</B>
<DD>
user password (for encrypted files)
<DT id="19"><B>-hidden</B>
<DD>
force hidden text extraction
<DT id="20"><B>-fmt</B>
<DD>
image file format for Splash output (png or jpg).
If complex is selected, but -fmt is not specified,
-fmt png will be assumed
<DT id="21"><B>-nomerge</B>
<DD>
do not merge paragraphs
<DT id="22"><B>-nodrm</B>
<DD>
override document DRM settings
<DT id="23"><B>-wbt &lt;fp&gt;</B>
<DD>
adjust the word break threshold percent. Default is 10.
Word break occurs when distance between two adjacent characters is
greater than this percent of character height.
<DT id="24"><B>-fontfullname</B>
<DD>
outputs the font name without any substitutions.
<P>
</DL>
<A NAME="lbAF">&nbsp;</A>
<H2>AUTHOR</H2>
<P>
Pdftohtml was developed by Gueorgui Ovtcharov and Rainer Dorsch. It is
based and benefits a lot from Derek Noonburg's xpdf package.
<P>
This manual page was written by Søren Boll Overgaard &lt;<A HREF="mailto:boll@debian.org">boll@debian.org</A>&gt;,
for the Debian GNU/Linux system (but may be used by others).
<A NAME="lbAG">&nbsp;</A>
<H2>SEE ALSO</H2>
<B><A HREF="/cgi-bin/man/man2html?1+pdfdetach">pdfdetach</A></B>(1),
<B><A HREF="/cgi-bin/man/man2html?1+pdffonts">pdffonts</A></B>(1),
<B><A HREF="/cgi-bin/man/man2html?1+pdfimages">pdfimages</A></B>(1),
<B><A HREF="/cgi-bin/man/man2html?1+pdfinfo">pdfinfo</A></B>(1),
<B><A HREF="/cgi-bin/man/man2html?1+pdftocairo">pdftocairo</A></B>(1),
<B><A HREF="/cgi-bin/man/man2html?1+pdftoppm">pdftoppm</A></B>(1),
<B><A HREF="/cgi-bin/man/man2html?1+pdftops">pdftops</A></B>(1),
<B><A HREF="/cgi-bin/man/man2html?1+pdftotext">pdftotext</A></B>(1)
<B><A HREF="/cgi-bin/man/man2html?1+pdfseparate">pdfseparate</A></B>(1),
<B><A HREF="/cgi-bin/man/man2html?1+pdfsig">pdfsig</A></B>(1),
<B><A HREF="/cgi-bin/man/man2html?1+pdfunite">pdfunite</A></B>(1)
<P>
<HR>
<A NAME="index">&nbsp;</A><H2>Index</H2>
<DL>
<DT id="25"><A HREF="#lbAB">NAME</A><DD>
<DT id="26"><A HREF="#lbAC">SYNOPSIS</A><DD>
<DT id="27"><A HREF="#lbAD">DESCRIPTION</A><DD>
<DT id="28"><A HREF="#lbAE">OPTIONS</A><DD>
<DT id="29"><A HREF="#lbAF">AUTHOR</A><DD>
<DT id="30"><A HREF="#lbAG">SEE ALSO</A><DD>
</DL>
<HR>
This document was created by
<A HREF="/cgi-bin/man/man2html">man2html</A>,
using the manual pages.<BR>
Time: 00:05:22 GMT, March 31, 2021
</BODY>
</HTML>