home *** CD-ROM | disk | FTP | other *** search
- SWISH README
- ~~~~~~~~~~~~
-
- What should be in this directory:
-
- swish.11.tar.Z SWISH 1.1 distribution
-
- The documentation's at
-
- http://www.eit.com/software/swish/
-
- ...and you can email kevinh@eit.com for suggestions, contributions, etc.
-
- Enjoy!
-
- -- Kevin Hughes
- kevinh@eit.com
- 3/13/95
-
- --
-
- Announcing SWISH 1.1
- ~~~~~~~~~~~~~~~~~~~~
-
- What's SWISH?
- ~~~~~~~~~~~~~
-
- SWISH stands for Simple Web Indexing System for Humans -
- it's a program that allows you to index your Web site and search
- for files using keywords in a fast and easy manner.
- For instance, to index your Web site, just type:
-
- swish -i /usr/local/www
-
- This creates a single file called "index.swish" that you
- can search. Then, to search for files with certain words, such as
- "john and doe or jane", just type:
-
- swish -w john and doe or jane
-
- What's New
- ~~~~~~~~~~
-
- * You can search for words specifically in HTML
- <BODY>, <HEAD>, <TITLE>, header, comment, and
- emphasized (<B>, <I>, <EM>, <STRONG>) tags, or
- use any combination of these criteria.
-
- * You can now use NOT booleans, parentheses, and
- wildcards to match the beginnings of words, so
- you can make searches like:
-
- "((t* and that) or (not apples and oran*))"
-
- * You can index and search words with 8-bit HTML
- entities in them. Entities can be converted to
- their 7-bit ASCII equivalents, if they exist, so
- a word such as "resumé" can be searched
- as "resume".
-
- * Searching is about three times faster on average.
-
- * You can specify multiple directories and files to
- index, and multiple index files to search.
-
- * Stopwords can be automatically found and included in
- index files.
-
- * You can merge index files larger than available
- memory. Merging removes all redundant information.
-
- * Many other features and bug fixes.
-
- SWISH is not as fully-featured as many indexing programs, but
- it has been customized for indexing Web sites containing HTML files
- and images. Search results are ranked, words in title and header tags
- are indexed and weighted accordingly, and simple boolean searches are
- supported. Using a SWISH configuration file, you can specify the files
- to index using a number of criteria. SWISH indexes are typically
- 50% of the size of WAIS indexes and 1 to 5% of the size of the original
- HTML data, indexing is faster, and searching is typically as fast as
- most WAIS searches.
- With WWWWAIS acting as a gateway, you can use SWISH just like
- waisq and similar programs.
-
- You can see how it all looks at:
-
- http://www.eit.com/cgi-bin/wwwwais
-
- Enjoy,
-
- -- Kevin
-