home *** CD-ROM | disk | FTP | other *** search
- Newsgroups: comp.unix.questions
- Path: sparky!uunet!mnemosyne.cs.du.edu!nyx!sscrivan
- From: sscrivan@nyx.cs.du.edu (steve scrivano)
- Subject: dictionary search
- Message-ID: <1993Jan5.223437.16029@mnemosyne.cs.du.edu>
- Keywords: dictionary search
- Sender: usenet@mnemosyne.cs.du.edu (netnews admin account)
- Organization: Nyx, Public Access Unix @ U. of Denver Math/CS dept.
- Date: Tue, 5 Jan 93 22:34:37 GMT
- Lines: 60
-
- Could someone who has access to a site with a dictionary of 200,000 English
- words or more be so kind as to run this script and e-mail me the "out.file".
- If your dictionary is not in "/usr/dict/words" just modify the program to
- change the name. I have been told that several such dictionaries exist,
- however, even I could download them, my machine storage is just too small
- and I really don't need a whole dictionary of this size anyway but just the
- results of this script. To save space, you can send the out.file in
- compressed/uuencoded format.
-
- </usr/dict/words tr '[A-Z]' '[a-z]' |
- sed '
- /^.$/d
- /^..$/d
- /^...$/d
- /[abdjkmqv-z0-9]/d
- /c.*c/d
- /cc/d
- /e.*.e.*e/d
- /e.*.ee/d
- /ee.*e/d
- /f.*f/d
- /ff/d
- /g.*g/d
- /gg/d
- /h.*h/d
- /hh/d
- /i.*.i.*i/d
- /i.*.ii/d
- /ii.*i/d
- /l.*.l.*l/d
- /l.*.ll/d
- /ll.*l/d
- /n.*.n.*n/d
- /n.*.nn/d
- /nn.*n/d
- /o.*o/d
- /oo/d
- /p.*.p.*p/d
- /p.*.pp/d
- /pp.*p/d
- /r.*r.*r.*r/d
- /rr.*r.*r/d
- /r.*r.*rr/d
- /r.*rr.*r/d
- /s.*.s.*s/d
- /ss.*s/d
- /s.*.ss/d
- /t.*t.*t.*t/d
- /tt.*t.*t/d
- /t.*t.*tt/d
- /t.*tt.*t/d
- /u.*u/d
- /uu/d
- ' > out.file
-
-
- Thanks so much for your help!
-
- Steve Scrivano
- sscrivan@nyx.cs.du.edu
-