The SRE-Filter For GoServe Web Server |
|
Users guide |
SRE-Filter is a HTTP (Web) Server for OS/2. Written in REXX, SRE-FILTER is designed to run under the IBM EWS GoServe program.Narrowing our focus just a bit ...
SRE-Filter is designed to be a relatively full-featured package for non-cutting edge, small-to-medium load sites being maintained by non-professionalsIn fact, SRE-Filter has many obscure, and useful, features. These include:
Bored already? You may find the FAQ more informative, the manual more immediately useful, and the list of documentation files a useful index.
Not bored? You can proceed linearly through this documentation, or you can use the table of contents to jump to the desired section.Please be aware that this guide strikes a balance between brevity and thoroughness. There will be features not mentioned. So repeat after me: the manual, the discussion of initialization paramters, and the list of documentation files are the places to go for more detailed discussion.
Let me add one proviso: if you want to use server side includes, you should carefully read the relevant section of the SRE-Filter manual. They can do a lot for you...
Introduction | you just read it. |
Table of contents | you're reading it now. |
Installation | some installation instructions, and some basic terminology. |
Multiple Hosts | using SRE-Filter to support multiple hosts and IP aliases |
Controlling Access | controlling access to your server's resources |
Resolving requests | redirection and interpretation of the client's request |
Server side includes | creating dynamic HTML documents |
CGI-BIN, Imagemaps, SRE-Filter addons | the wonderful world of server side processing (FORMS and beyond) |
Uploads, HEAD method requests, and other methods | some powerful, but more obscure, WWW tricks |
Auditing, Pre-filter processing, and Post-filter processing | anticipating, and recording, requests. |
Optimization hints | a few suggestions on how to improve SRE-Filter's performance. |
The synopsis of features | an outline of what SRE-Filter can do |
The disclaimer and acknowlegments | the documentation wouldn't be complete without one |
First, have you installed
GoServe? I'll assume that you have.
Second do you have the most
recent copy
of SRE-Filter
(if you have a slow connection, you might need to download SRE-Filter in pieces)?
Assuming that you have, you should create (or clear out) a temporary directory,
and UNZIP
the (possibly several) SRE-Filter distribution files.
In many cases, the connection between the selector and a server resource it refers to is simple (such as when the selector is a filename relative to the GoServe data directory). In other cases, it's completely virtual (as when the resource consists of the output of a program that's run using information provided by the client).
In much the same way that OS/2 extended attributes contain additional information about a file, SRE-Filter's SEL-specific information contain additional information about a selector ; or, to be bit more precise, about the server resource identified by the selector (needless to say, this information is only used by SRE-Filter).
The easiest way to add (and remove) hosts is through the Multiple Hosts section of the simple-mode configurator. You may also want to examine the description of the HOSTS parameter in INITFILT.DOC, and the discussion of hosts in the SRE-Filter manual.
Type of control |
Description | Configuration Information? |
---|---|---|
|
||
LOGON controls |
At the coarsest level, you can require all clients to LOGON. In most
cases, logging on requires that the client provide a username and password, which
are then compared against the SRE-Filter user database. If there is no match,
SRE-Filter will return a simple you are not authorized response.
Alternatively, you can construct a customized response.
Since this logon requirement is a bit of a hassle, you can specify OWNERS and In-house clients; for whom logon requirements are waived. This specification is by IP address, with the possibility of using wildcards to grant access rights access to entire domains. |
The Logon Controls section of the simple-mode configurator
can be used to enable this Logon requirement, to add and remove
users, and to add and remove In-house clients.
Advanced User Notes: The following SRE-Filter parameters may be of interest (see INITFILT.DOC for details). |
|
||
Public areas |
The basic idea is that before logon or access_controls
are checked, SRE-Filter checks if the request selector
points to a PUBLIC area.
If so, no logon or access controls
are attempted.
In a sense, the PUBLIC areas are purposely placed outside of the protection of SRE-Filter's various access controls. |
You can use the Public Areas section of the simple mode configurator
to add and remove public areas. For a more nuanced control,
you probably will want to use the Modify Initializaton Parameters
section of the the intermediate configurator -- look for the sub-section on
the PUBLIC_URLS parameters.
Advanced users note: Significant performance advantages can be realized by using the SREFQUIK variant of SRE-Filter. This requires that you establish as many literal PUBLIC_URLS as possible. The intermediate mode configurator can be used to accomplish this. |
Since many sites will contain a continuum of private and public resources, this coarse level of control will often be inappropriate. SRE-Filter offers two finer methods of access controls: SEL-specific access controls, and the HTACCESS method.
SEL-specific access controls |
Selector-specific access (aka SEL-specific access controls )
are based on a comparison between
the request selector (the request selector is
part of the http request string that the client sends to the server...
... it's the URL minus http://x.y.z/) and a
a list of SEL-specific access controls. This list (which may contain
wildcarded
entries) dictates who can access the various resources (files, etc.)
on your site.
Given a match, SRE-Filter will then compare the list of resource privileges assigned to the selector to the client privileges granted to the client. In general, the client must have at least one client privilege that's also in the resource privilege list (though you can specify more complicated conditions). If no such matching privilege exists, SRE-Filter returns an unauthorized access response. This response can be a generic response, a customized-for-your-site response, or a customized-for-this- selector response. For a more detailed description of SRE-Filter's various access control mechanisms, see the the SRE-Filter FAQ |
The Access Controls section of the simple-mode
configurator lets you enable
access controls, as well as choose between the generic and customized
unauthorized access response.
You can also use the simple-mode configurator to add and remove entries in the list of
SEL-specific access controls.
Note that when you use the simple-mode configurator to create a selector specific access control , you will be asked to provide the selector (with optional wildcards) a list of resource privileges, and an optional permission list. You should use the Modify SEL-specific Access Controls section of the intermediate mode configurator to: Client privileges are granted when you set up user entries. You may also want to use the intermediate mode configurator to modify the INHOUSEIPS_PRIVS and PUBLIC_PRIVS parameters. If you want to get fancy, see ADDPRIVS.DOC for details on how you can specify dynamic client privileges. |
|
||
The HTACCESS method | The HTACCESS method (which is something of a standard) uses special HTACCESS files located in the directory (and in the parent directories) of the requested file. This (or these) HTACCESS files contain information on who has access rights to files in the directory (and in child directories). | HTACCESS files can be created using your favorite text-editor. You might want to read some documentation on the HTACCESS method of access control. Alternatively, you can use the intermediate configurator's Modify HTACCESS Contols options to change HTACCESS files. |
|
The choice between the SEL-specific and HTACCESS methods is a matter of taste and convenience -- controlling directories is a sure way to prevent access to files, but controlling selectors is more flexible (if your physical directory structure changes more frequently then your pages). In fact, these two methods (SEL-specific and directory-specific) can be used jointly, which implies multiple checks. But be careful when using both methods, it is possible to get stuck in authorization loops!
Since SEL-specific controls are native to SRE-Filter, they will tend to be faster (and better tested) then the HTACCESS method. Therefore, our recommendation is to use the SEL-specific method.Return to table of contents.
The problem | Solutions |
---|---|
| |
A document was not specified | You can use the Default Documents section of the simple-mode configurator to:
|
| |
Document could not be found |
By default, SRE-Filter will:
|
| |
A document has been moved to another location | SRE-Filter uses aliases to redirect requests to another URL. This URL need not be on the same server, and it need not have the same name. You can use the Home Directory, virtual directories, and redirection section of the simple-mode configurator to create these redirection aliases. |
| |
You want to catch common mistaakes | SRE-Filter uses aliases to specify local redirections: which involve textual substitutions in the recieved request selector. Among other advantages, this gives you quite a bit of control over how "SRE-Filter facilities, and external procedures" percieve The Request. The intermediate mode's Modify Redirection Aliases section can be used to create these local redirection aliases. |
| |
You want to specify a ~ directory | The home directory is used as a text replacement whenever a ~
is encountered in a request selector.
A typical value of this would be "Users/".
Further refining the use of home directory, you may wish clients to have access to particular subdirectories of your "Users" directories. For example, all "students" may have space on the server machine, some of which is used for web, and some for "personal" purposes. The goal is to give clients direct access to the "web" related sub-directories but not to the "personal" sub-directories. To modify these home directory options, see the Home Directory, virtual directories, and redirection section of the simple-mode configurator |
| |
You want to transfer files from outside of GoServe's data directory | By default, SRE-Filter will match a request selector to a file in the GoServe
data directory. While a good security feature (files not in or under
the GoServe data directory are inaccessible), this can be an inconvenience.
To remedy this inconvenience, one can define virtual directories Basically, SRE-FILTER will compare the starting portion of the request selector to see if it matches one of your virtual directory entries. If it does, the target directory listed in the matching entry is used (instead of the GoServe data directory). Thus, you can make available a wide, but controllable, set of directories (on or LAN accessible from) your server. To create virtual directories, see the Home directory, virtual directories, and redirection section of the simple-mode configurator. Advanced users notes:
|
The feature | Discussion | More details |
---|---|---|
| ||
Caching of documents that contain SSIs. | To improve performance, SRE-Filter will "cache" HTML documents that have server side includes (SSI). This SSI-caching is of the document after the SSIs have been performed. In simple cases SRE-Filter can reuse this cached file, without having to repeat the actual process of SSI lookups, etc. Needless to say, this can greatly improve server performance. |
You can use the Server Side Includes section of the simple-mode configurator to
enable SSI-caching. For a detailed discussion of SSI-Caching, see SSICACHE.HTM. |
| ||
Suppressing Server Side Includes | You can suppress server side includes for everyone, or for selected HTML
documents.
The NO_INCLUDE parameter can be used to suppress all SSI's. A less drastic approach is to use a NO_SSI access-control permission. | See INITFILT.DOC for a discussion of NO_INCLUDE. To set access control permissions, see the Access Controls section of the intermediate configurator. |
| ||
Limiting SSIs to a subset of your HTML documents | SRE-Filter can be instructed to attempt server side includes (SSIs) only on a subset of HTML documents; such as those with .SHT or a .SHTML extension. In other words, HTML documents with .HTM or .HTML extension will not be checked for server side includes. This can speed up file transfer (especially when used in conjunction with SSI-caching), but does require more care when naming html files. | See the Server Side Includes section of the simple-mode configurator to
enable this SSI on SHTML files only feature.
Advanced users notes: |
| ||
Headers and Footers | You can include headers and footers in all your HTML documents. They can contain Server Side Include keyphrases (see below). | You can set both headers and footers in the Server Side Include section of the simple-mode configurator (or see the SRE-Filter manual for further details). |
|
||
The NSCA HTTPD-style server side include syntax | The
NSCA HTTPD-style server side include syntax is fully supported by SRE-Filter.
These include:
|
The SRE-Filter manual also describes how to use NCSA HTTPD-style server side. You may also want to see TIMEFMT.DOC for a description of time and date display formats. |
| ||
The SRE-Filter syntax | The SRE-Filter syntax overlaps the NCSA HTTPD-style syntax, but
there are some
important additions (note: you can use both
types of SSIs in your documents). Additional features include:
|
The SRE-Filter manual contains a detailed discussion of
the SRE-Filter SSI syntax, including a discussion of the suppression of hits feature.
The SRE-Filter
FAQ contains a discussion of the various "hit" counting mechanisms available to
SRE-Filter, as does COUNTER.DOC.
You can use the Server Side Include section of the simple-mode configurator to modify most of the static variables. |
The feature | Discussion |
---|---|
| |
ImageMaps | SRE-Filter supports NSCA-style server-side imagemaps.
The SRE-Filter manual contains a detailed discussion of how to specify and configure imagemaps. You can also play with SAMPMAP.HTM (a sample image map document, it uses the SAMPMAP.MAP imagemap file). |
| |
Searchable Indices | SRE-Filter is shipped with DOSEARCH, a moderately powerful text-file search engine.
When used in conjunction with aliases, it is easy to implement searchable indices.
The SRE-Filter manual contains a short discussion of how to setup searchable indices, as does the ALIASES.IN file. |
| |
Special Requests | SRE-Filter supports several special requests, all of which start with an exclamation point (!) The current set of special requests are: See the SRE-Filter manual for details. |
| |
CGI-Bin support | SRE-Filter offers full support for the
Common Gateway Interface.
SRE-Filter offers a few enhancments, including: |
| |
SRE-Filter add-ons | As an alternative to the somewhat clunky CGI-Bin interface, SRE-Filter add-ons can
be used. The STATUS program is a sample of one such add-on.
On a more useful scale, the following SRE-Filter add-ons can be obtained from the SRE-Filter home page.
|
The protocol | Discussion |
---|---|
HEAD requests | Although not frequently used, HEAD method requests offer some throughput advantages when
searching over a broad range of sites. To fully support such requests, SRE-Filter can
parse the <HEAD> section of HTML documents, and return (in the response header)
the contents of LINK, NAME and META-EQUIV
elements. In fact, SRE-Filter can parse the <HEAD>
for GET requests too (though this may not be a very useful activity).
For further discussion, see the description of the AUTO_HEADER parameter in INITFILT.DOC, or see the SRE-Filter manual. |
| |
File Uploads using HTML FORMS |
SRE-Filter's PUT_FILE facility
supports the multipart/form-data method of file upload
(as supported by NetScape 2.01).
This requires the use of an HTML document that contains a special type
of <FORM>.
Notes: |
| |
Uploads from other servers | Though somewhat obsolete with the advent of
multipart/form-data method of file upload, SRE-Filter's GET_URL
facility permits
clients to transfer files from different http servers to your server.
Notes: |
| |
The PUT and DELETE methods | The PUT and DELETE HTTP 1.1 methods allow clients to upload, and delete, files
from your server. Most browsers do not support these methods.
Notes:
|
| |
Byte-range retrieval |
SRE-Filter supports byte-range retrieval. Byte-range retrieval, which
consists of sending selected portions of a document, is used
by Adobe Acrobat to retrieve selected pages from large .PDF files.
For further discussion, see the description of the ACCEPT_RANGE parameter in INITFILT.DOC. |
| |
Setting the expiration date of documents | To work around NetScape's tendency to over-refresh documents that have an
immediate expiration date, you can instruct SRE-Filter to modify
GoServe's default temporary file response header. Specifically, you can change the immediate expiration date
to an expire in 2 hours expiration date.
The simple mode configurator's Miscellaneous section can be used to select this option. For further discussion, see the description of FIX_EXPIRE in INITFILT.DOC. |
Action | Description |
---|---|
|
|
SRE-Filter Audit files |
SRE-Filter allows you to
use several different audit mechanisms.
|
| |
Load Balancing | SRE-Filter supports a very simple form of load balancing. Specifically, when
the load on your server (measured in number of current transactions) gets too
high, SRE-Filter can redirect the request to (one of several) backup servers.
For details, see the description of the BACKUPSERVERLIST and LOADTHRESHOLD parameters (in INITFILT.DOC), or see the SRE-Filter manual. |
| |
Special Directives | SRE-Filter understands several special directives. These are prefixes added
to request selector that can modify SRE-Filter's logic (they all start with
a ! )
|
| |
Pre-filter processing | There may be occasions when you want some other filter to process
a request before SRE-Filter get's a shot at it. For example, ambitous programmers
may wish to implement a more sophisticated load balancing algorithim; or you might
wish to implement an alternative access control mechanism. As a more practical example,
SRE-Filter comes with PREFILTR.80; a pre-filter that passes
a request to the GOREMOTE.80 server remote control package.
The SRE-Filter manual describes how to enable pre-filters; or you can read the description of the PRE_FILTER and PREFILTER_NAME parameters in INITFILT.DOC. |
| |
Post-filter processing | After SRE-Filter has responsed to a request (and GoServe has closed the connection),
you may wish to perform further actions. For example, you may wish to notify
the webmaster of special events (see
POSTMAIL.80 for an example), or you may
wish to record select requests (see POSTRCRD.80 for an example).
The SRE-Filter manual describes how to enable post-filters; or you can read the description of the POST_FILTER and POSTFILTER_NAME parameters in INITFILT.DOC. |
| |
SRE-Filter is not a super-fleet web server.
The design goal has always
been comprehensiveness (without an enormous amount of complication). Within
the context of an interpreted language (REXX), that has a price.
That said, a lot has been done to improve throughput. In particular; the extensive use of macrospace, the use of independent threads to handle much of the mundane processing, and the incorporation of an SSI-cache help keep SRE-Filter's performance respectable. Still, it would be nice if it were faster.
Well, if you are willing to work on it, that's actually quite a bit you can do to improve it's performance. The following outlines some of the things you can do. If you want more specific information, the best place to go is the first section of INITFILT.DOC -- it contains a number of optimzation hints.
Action | Description |
---|---|
The GoServe Cache | By far the best optimization strategy is to take advantage of GoServe's
cache (look at the Options-General tab under GoServe). Unfortunately, there
is a potential drawback -- anything that's cached will be free of access controls
(since GoServe sends back cached items without fussin' around).
Therefore, if you have any security on your site, SRE-Filter will disable
the cache.
To work around this problem, you can use the CACHE access permission, or the PUBLIC_URLS public area identifiers. |
The SREFQUIK variant of SRE-Filter | For many requests you won't need all the bells and whistles built into SRE-Filter.
Recognizing this, SRE-Filter allows you to call a small/simple filter. This filter
(SREFQUIK) will process a select set of requests, and pass the one's it can't handle
to the standard version of SRE-Filter.
In particular, SREFQUIK can handle
"cached" requests, and it can handle literal PUBLIC_URLS.
The SREFQUIK documentation thoroughly describes the use SREFQUIK. You should also read the description of PUBLIC_URLS in INITFILT.DOC. |
Suppressing unneeded features | If you know you aren't going to use one of SRE-Filter's features; you can and should
instruct SRE-Filter not to use it. In general, there are two ways of doing this: across
the board, or on a selector specific basis.
For example, you can
turn off Virtual Directory lookup for everyone (by setting VIRTUAL_FILE=' '), you
can use the NO_VIRTUAL permission to suppress virtual directory lookup on a case by case (
selector by selector ) basis, or you can set up a literal PUBLIC_URL.
|
Seperate data directories, aliases, etc. can be specified for each supported host
SRE-Filter, and related products, are to be used at your own risk; the authors assume no liability for any damage that may be caused by the use or misuse of these software products.The nice news is:
SRE-Filter is copyright by Daniel Hellerstein, and hereby placed in the the public domain. You may use is it any manner you deem fit, up to and including use of the code in your own software.
Copyright 1996,1997 by Daniel Hellerstein. Permission to use this program for any purpose is hereby granted without fee, provided that the author's name not be used in advertising or publicity pertaining to distribution of the software without specific written prior permision. With some exception, this includes the right to subset and reuse the code, with proper attribution. The exceptions are two fold: 1) Portions of the code adapted from other author's work (these are noted where appropriate); you'll need to contact these other authors for appropriate permissions. 2) SRE-Filter uses Quercus System's REXXLIB procedure library. The license for REXXLIB gives the author the right to distribute REXXLIB without charge. This right may NOT extend to redistributors. Please contact Quercus Systems for details. SRE-Filter also uses Info-Zip's UNZIPAPI. Although this is freely available software, you may wish to contact Info-Zip for details. Furthermore you may also charge a reasonable re-distribution fee for SRE-Filter; with the understanding that this does not remove the work from the public domain. THIS SOFTWARE PACKAGE IS PROVIDED "AS IS" WITHOUT EXPRESS OR IMPLIED WARRANTY. THE AUTHOR DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE PACKAGE, INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS. IN NO EVENT SHALL THE AUTHOR (Daniel Hellerstein) OR ANY PERSON OR INSTITUTION ASSOCIATED WITH THIS PRODUCT BE LIABLE FOR ANY SPECIAL,INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN AN ACTION OF CONTRACT,NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE PACKAGE. SRE-FILTER was developed on the personal time of Daniel Hellerstein, and is not supported, approved, or in any way an official product of my employer (USDA/ERS).