man-pages/man3/WWW::RobotRules::AnyDBM_File.3pm.html
2021-03-31 01:06:50 +01:00

83 lines
2.1 KiB
HTML

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<HTML><HEAD><TITLE>Man page of WWW::RobotRules::AnyDBM_File</TITLE>
</HEAD><BODY>
<H1>WWW::RobotRules::AnyDBM_File</H1>
Section: User Contributed Perl Documentation (3pm)<BR>Updated: 2018-04-14<BR><A HREF="#index">Index</A>
<A HREF="/cgi-bin/man/man2html">Return to Main Contents</A><HR>
<A NAME="lbAB">&nbsp;</A>
<H2>NAME</H2>
WWW::RobotRules::AnyDBM_File - Persistent RobotRules
<A NAME="lbAC">&nbsp;</A>
<H2>SYNOPSIS</H2>
<PRE>
require WWW::RobotRules::AnyDBM_File;
require LWP::RobotUA;
# Create a robot useragent that uses a diskcaching RobotRules
my $rules = WWW::RobotRules::AnyDBM_File-&gt;new( 'my-robot/1.0', 'cachefile' );
my $ua = WWW::RobotUA-&gt;new( 'my-robot/1.0', '<A HREF="mailto:me@foo.com">me@foo.com</A>', $rules );
# Then just use $ua as usual
$res = $ua-&gt;request($req);
</PRE>
<A NAME="lbAD">&nbsp;</A>
<H2>DESCRIPTION</H2>
This is a subclass of <I>WWW::RobotRules</I> that uses the AnyDBM_File
package to implement persistent diskcaching of <I>robots.txt</I> and host
visit information.
<P>
The constructor (the <I>new()</I> method) takes an extra argument specifying
the name of the <FONT SIZE="-1">DBM</FONT> file to use. If the <FONT SIZE="-1">DBM</FONT> file already exists, then
you can specify undef as agent name as the name can be obtained from
the <FONT SIZE="-1">DBM</FONT> database.
<A NAME="lbAE">&nbsp;</A>
<H2>SEE ALSO</H2>
WWW::RobotRules, LWP::RobotUA
<A NAME="lbAF">&nbsp;</A>
<H2>AUTHORS</H2>
Hakan Ardo &lt;<A HREF="mailto:hakan@munin.ub2.lu.se">hakan@munin.ub2.lu.se</A>&gt;, Gisle Aas &lt;<A HREF="mailto:aas@sn.no">aas@sn.no</A>&gt;
<P>
<HR>
<A NAME="index">&nbsp;</A><H2>Index</H2>
<DL>
<DT id="1"><A HREF="#lbAB">NAME</A><DD>
<DT id="2"><A HREF="#lbAC">SYNOPSIS</A><DD>
<DT id="3"><A HREF="#lbAD">DESCRIPTION</A><DD>
<DT id="4"><A HREF="#lbAE">SEE ALSO</A><DD>
<DT id="5"><A HREF="#lbAF">AUTHORS</A><DD>
</DL>
<HR>
This document was created by
<A HREF="/cgi-bin/man/man2html">man2html</A>,
using the manual pages.<BR>
Time: 00:06:00 GMT, March 31, 2021
</BODY>
</HTML>