home *** CD-ROM | disk | FTP | other *** search
- Path: sparky!uunet!olivea!decwrl!access.usask.ca!jester.usask.ca!fogel
- From: fogel@jester.usask.ca (Earl Fogel)
- Newsgroups: alt.gopher
- Subject: Re: index for all of gopherspace
- Message-ID: <1992Jul29.154103.26402@access.usask.ca>
- Date: 29 Jul 92 15:41:03 GMT
- References: <1992Jul27.163746.974@nstn.ns.ca> <1992Jul27.180509.27470@mercury.unt.edu> <98ANB22K@cc.swarthmore.edu>
- Sender: news@access.usask.ca (USENET News System)
- Organization: University of Saskatchewan
- Lines: 22
- Nntp-Posting-Host: jester.usask.ca
-
- The problem with the Wais directory-of-servers is that the database
- descriptions are woefully incomplete. Searches for "Mac" and "Macintosh",
- for example, produce very different results.
-
- And I'm not convinced we can rely on gopher server administrators to
- maintain up-to-date, keyword-rich site descriptions, as has been suggested.
-
- Instead, I wonder if a site description database could be created
- automatically, by waisindexing the information at each site, and
- compiling a list of the top-scoring words. The directory-of-servers
- would then maintain a database of the most frequently occuring 1%
- of the words in each database.
-
- Presumably, "Mac" and "Macintosh" would make the most-frequent list
- for Mac-related databases, while "dessert" and "cake" would make the
- list for recipe databases.
-
- Earl Fogel
- -------------------------------------------------------------------
- fogel@sask.usask.ca Computing Services, Room 56 Physics
- Phone: (306) 966-4861 University of Saskatchewan
- Fax: (306) 966-4938 Saskatoon, Sask. CANADA, S7N 0W0
-