home *** CD-ROM | disk | FTP | other *** search
- Newsgroups: comp.unix.questions
- Path: sparky!uunet!mcsun!news.funet.fi!hydra!klaava!widenius
- From: widenius@klaava.Helsinki.FI (Risto Widenius)
- Subject: Re: dictionary search
- Message-ID: <1993Jan6.054639.23826@klaava.Helsinki.FI>
- Keywords: dictionary search
- Organization: University of Helsinki
- References: <1993Jan5.223437.16029@mnemosyne.cs.du.edu>
- Date: Wed, 6 Jan 1993 05:46:39 GMT
- Lines: 38
-
- sscrivan@nyx.cs.du.edu (steve scrivano) writes:
-
- >Could someone who has access to a site with a dictionary of 200,000 English
- >words or more be so kind as to run this script and e-mail me the "out.file".
-
- [horror sed script deleted]
-
- I don't know how intelligent sed implementations generally are in
- parsing regular expressions, but shorter seems better to me in any case.
-
- This is the same script that Jean-Pierre Radley posted; with some excess
- dots stripped to make it work like expected. Otherwise I have to stress
- that this is jpr's excellent work:
-
- </usr/dict/words tr '[A-Z]' '[a-z]' |
- sed '
- /^.$/d
- /^..$/d
- /^...$/d
- /[abdjkmqv-z0-9]/d
- /c.*c/d
- /e.*e.*e/d
- /f.*f/d
- /g.*g/d
- /h.*h/d
- /i.*i.*i/d
- /l.*l.*l/d
- /n.*n.*n/d
- /o.*o/d
- /p.*p.*p/d
- /r.*r.*r.*r/d
- /s.*s.*s/d
- /t.*t.*t.*t/d
- /u.*u/d
- ' > out.file
-
- --
- widenius@cc.helsinki.fi "But now there was worse." -Bernard Levin
-