home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
Dream 52
/
Amiga_Dream_52.iso
/
Linux
/
Magazine
/
wwwoffle-2.1.tar.gz
/
wwwoffle-2.1
/
README.CONF
< prev
next >
Wrap
Text File
|
1998-02-19
|
15KB
|
356 lines
WWWOFFLE - World Wide Web Offline Explorer - Version 2.1
========================================================
If you are upgrading from version 1.x to version 2.x then you should read the
file CHANGES.CONF which explains how to convert the sections in your existing
wwwoffle.conf file to the new format.
If you are upgrading from version 2.0[abc] to version 2.1 then the file
CHANGES.CONF shows the new options.
The configuration file (wwwoffle.conf) specifies all of the parameters that
control the operation of the proxy server. The file is split into sections each
containing a series of parameters as described below.
The sections are delimited in the file by having the section name alone on a
line, a line containing a single '{', the parameters in the section and a line
containing a single '}'. Comments are marked by a '#' at the start of the line.
StartUp
-------
This contains the parameters that are used when the program starts, changes to
these are ignored if the configuration file is re-read while the program is
running.
http-port = <port> ; An integer specifying the port for the
HTTP proxy (default=8080).
wwwoffle-port = <port> ; An integer specifying the port for
wwwoffle control connections
(default=8081).
spool-dir = <dir> ; The name of the spool directory
(default=/var/spool/wwwoffle).
run-uid = <user> | <uid> ; The username or numeric uid to run the
wwwoffled server as (default=none).
run-gid = <group> | <gid> ; The groupname or numeric gid to run the
wwwoffled server as (default=none).
use-syslog = yes | no ; Whether to use the syslog facility for
messages (default=yes).
password = <word> ; The password used for authentication of
the control message (default=none).
max-servers = <integer> ; The maximum number of server processes
that are started (default=8).
max-fetch-servers = <integer> ; The maximum number of server processes
that are started to fetch pages that
were marked in offline mode (default=4).
Notes: For the password to work the configuration file must be set so that only
authorised users can read it.
: To use the run-uid/run-gid options, the server must be start as root.
: The max-fetch-servers value must be less than max-servers or you will
not be able to use wwwoffle interactively online while fetching.
Options
-------
Options that control how the program works.
log-level = debug | info | important | warning | fatal
; Log messages with this or higher priority
(default=important).
fetch-images = yes | no ; Whether to fetch the images that are contained in
pages that are requested while offline and
downloaded later (default=no).
fetch-frames = yes | no ; Whether to fetch the frames that are contained in
pages that are requested while offline and
downloaded later (default=no).
index-latest-days = <age> ; The number of days to display in the index of the
latest pages (default=7 days).
add-info-refresh = yes | no ; At the bottom of all of the spooled pages the
date that the page was cached and a refresh
button is to be added (default=no).
request-changed = <time> ; While online pages will only be fetched if the
cached version is older than this specified time
in seconds (default=600).
pragma-no-cache = yes | no ; Whether to request a new copy of a page if the
request has 'Pragma: no-cache' (default=yes).
offline-requests = yes | no ; Whether to record requests that are made while
offline or to return an error (default=yes).
monitor-interval = <age> ; The interval in days between monitoring of the
specified URLs (default=7 days).
Notes: The request-changed option can be set negative to indicate that cached
pages are always used while online.
: The pragma-no-cache option should be set to 'no' if when browsing
offline all pages are re-requested by a 'broken' browser.
: The monitor-interval option when set to '0' means to check each time
wwwoffle is online, '1' means once per day, etc.
LocalHost
---------
A list of hosts that the host running the wwwoffled server may be known by.
This is so that the proxy does not need to contact itself to get the server
local pages.
<host> ; A hostname or IP address that in connection with the port number (in
the StartUp section) specifies the wwwoffle proxy HTTP server.
Notes: All of these hosts are also used the same way as those in the
LocalNet and AllowedConnect sections.
: The first named host is used as the server name for several features so
should be a name that will work from any client host if on a network.
: None of the entries here or in LocalNet are fetched via a proxy.
LocalNet
--------
A list of hosts that are not to be cached by wwwoffled because they are on a
local network.
<host> ; A hostname or IP address that is not to be cached by the server.
Notes: The host name matches from the right so a domain name matches all hosts
in the domain, IP addresses match from the left.
: All entries here are assumed to be reachable even when offline.
: All of the hosts in LocalHost are also not cached.
: None of the entries here or in LocalHost are fetched via a proxy.
AllowedConnect
--------------
A list of client hosts that are allowed to connect to the server.
<host> ; A hostname or IP address that is allowed to connect to the server.
Notes: The host name matches from the right so a domain name matches all hosts
in the domain, IP addresses match from the left.
: All of the hosts in LocalHost are also allowed to connect.
DontCache
---------
A list of servers and files on them that are not to be cached by wwwoffled.
default = ... ; When getting any URLs ...
HOST-SPECIFICATION = ... ; When getting URLs that match this ...
... = FILE-SPECIFICATION ; ... don't cache any files that match this.
Notes: See the bottom of this file for the description of HOST-SPECIFICATION
: See the bottom of this file for the description of FILE-SPECIFICATION
: The files will still be cached if fetched non-interactively.
DontGet
-------
A list of servers and files on them that are not to be got by wwwoffled (because
they contain only junk adverts for example).
default = ... ; When getting any URLs ...
HOST-SPECIFICATION = ... ; When getting URLs that match this ...
... = FILE-SPECIFICATION ; ... don't get any files that match this.
Notes: See the bottom of this file for the description of HOST-SPECIFICATION
: See the bottom of this file for the description of FILE-SPECIFICATION
DontGetRecursive
----------------
A list of servers and files on them that are not to be got by wwwoffled when
fetching recursively.
default = ... ; When recursively getting any URLs ...
HOST-SPECIFICATION = ... ; When recursively getting URLs that match
this ...
... = FILE-SPECIFICATION ; ... don't get any files that match this.
Notes: See the bottom of this file for the description of HOST-SPECIFICATION
: See the bottom of this file for the description of FILE-SPECIFICATION
CensorHeader
------------
A list of HTTP header lines that are to be removed from the requests sent to web
servers.
<header> ; A header field name, e.g. From, Cookie, User-Agent.
Notes: The header is case sensitive, and does not have a ':' at the end.
FTPOptions
----------
Options to use when fetching files using ftp.
anon-username = <string> ; The username to use for anonymous ftp
(default=anonymous).
anon-password = <string> ; The password to use for anonymous ftp
(default=<user>@<host>, determined at run time).
auth-hostname = <host[:port]> ; A host to use a different username and password.
auth-username = <string> ; The username to use on the above host.
auth-password = <string> ; The password to use on the above host.
Notes: The anon-password should be set to a sensible value especially if you
are behind a firewall.
: The auth-hostname, auth-username and auth-password options must come
together as a triplet.
: The auth-hostname must be exact, it is not used as a wildcard match.
MIMETypes
---------
MIME Types to use when fetching files not using HTTP.
default = <mime-type>/<subtype> ; The default MIME type
(default=text/plain).
.<file-ext> = <mime-type>/<subtype> ; The MIME type to associate with a file
extension.
Notes: You must include the '.' in the file extension.
: If more than one of the extensions match then the longest is used.
Proxy
-----
This contains the names of the HTTP (or other) proxies to use external to the
local machine.
default = <host[:port]> ; The hostname and port on it to use as the
default proxy.
HOST-SPECIFICATION = <host[:port]> ; The hostname and port on it to use as the
proxy when getting URLs that match the
HOST-SPECIFICATION.
auth-hostname = <host[:port]> ; A proxy server that uses proxy authentication.
auth-username = <string> ; The username to use on the above host.
auth-password = <string> ; The password to use on the above host.
Notes: See the bottom of this file for the description of HOST-SPECIFICATION
: A hostname that matches more than one entry here uses the proxy of the
longest matching one (protocol is included in assessing length).
: You can use none or no hostname to indicate that a default or particular
protocol or host is not to use a proxy.
: None of the hosts in LocalNet/LocalHost will be fetched via a proxy.
: The auth-hostname, auth-username and auth-password options must come
together as a triplet.
: The auth-hostname must be exact, it is not used as a wildcard match.
Mirror
------
A list of servers that have mirrors to be used in preference or to resolve
conflicts over multiple names for the same server.
<proto>/<host> = <proto>/<host> ; The first named protocol and host is to be
replaced by the second named protocol and
host.
<host> = <host> ; The first named host is to be replaced by the
second named host for all protocols.
Notes: Symbolic links in the spool directory also work, but they are only
checked when wwwoffled is started or 'wwwoffle -config' is run.
: The host names must match exactly, no wildcards.
Purge
-----
The method to determine which pages to purge, the default age the host specific
maximum age of the pages in days, and the maximum cache size.
use-mtime = yes | no ; The method to use to decide which files to
purge, last access time (atime) or last
modification time (mtime) (default=no).
max-size = <size> ; The maximum size for the cache in MB
(default=0).
default = <age> ; The default maximum age of pages in days
(default=28).
HOST-SPECIFICATION = <age> ; The maximum age of pages that match the
HOST-SPECIFICATION
Notes: See the bottom of this file for the description of HOST-SPECIFICATION
: A hostname that matches more than one entry here uses the age of the
longest matching one (protocol is included in assessing length).
: An age of zero means not to keep, negative not to delete.
: A maximum cache size of 0 means there is no limit to the size.
: When there is a non-zero maximum cache size it is measured excluding all
hosts with a negative maximum age (never purged hosts).
--------------------------------------------------------------------------------
HOST-SPECIFICATION
------------------
When specifying a host and protocol in many of the sections a HOST-SPECIFICATION
can be used, this is a way of recognising a URL.
For the purposes of this explanation a URL is considered to be made up of three
parts.
proto The protocol that is used (e.g. http, ftp)
host The server hostname (e.g. www.gedanken.demon.co.uk) or a domain
name (e.g. demon.co.uk) or an IP address (e.g. 1.2.3.4) or a
subnet (e.g. 1.2.3).
port The port number on the host (e.g. default of 80 for HTTP).
For example the wwwoffle homepage: http://www.gedanken.demon.co.uk/wwwoffle/
The protocol is 'http', the host is 'www.gedanken.demon.co.uk', the port is the
default (in this case 80).
A HOST-SPECIFICATION can be any one of the following:
/ Any protocol, Any host, Any port
[/]<host> Any protocol, Named host, Default port
[/]<host>:<port> Any protocol, Named host, Specified port
[/]<host>: Any protocol, Named host, Any port
<proto>/ Named protocol, Any host, Any port
<proto>/<host> Named protocol, Named host, Default port
<proto>/<host>:<port> Named protocol, Named host, Specified port
<proto>/<host>: Named protocol, Named host, Any port
Where [] indicates an optional feature, and <> indicate a user supplied name or
number.
FILE-SPECIFICATION
------------------
Several of the sections allow a FILE-SPECIFICATION to be entered, this is a way
of recognising the path part of a URL.
A FILE-SPECIFICATION can be either of the following:
/<path> Any file with a path that start with the specified one.
.<file-ext> Any file that ends with the specified file extension.
Where <> indicate a user supplied name or number.
Note: The '/' or '.' at the start is not optional.