home *** CD-ROM | disk | FTP | other *** search
- Newsgroups: comp.ai,news.answers,comp.answers
- Path: senator-bedfellow.mit.edu!bloom-beacon.mit.edu!usc!howland.reston.ans.net!noc.near.net!das-news.harvard.edu!cantaloupe.srv.cs.cmu.edu!crabapple.srv.cs.cmu.edu!mkant
- From: mkant+@cs.cmu.edu (Mark Kantrowitz)
- Subject: FAQ: Artificial Intelligence FTP Resources 5/6 [Monthly posting]
- Message-ID: <ai-faq-5.text_745225830@cs.cmu.edu>
- Followup-To: poster
- Summary: FTP Resources for AI
- Sender: news@cs.cmu.edu (Usenet News System)
- Supersedes: <ai-faq-5.text_742546968@cs.cmu.edu>
- Nntp-Posting-Host: a.gp.cs.cmu.edu
- Reply-To: mkant+ai-faq@cs.cmu.edu
- Organization: School of Computer Science, Carnegie Mellon University
- Date: Fri, 13 Aug 1993 07:11:16 GMT
- Approved: news-answers-request@MIT.Edu
- Expires: Fri, 24 Sep 1993 07:10:30 GMT
- Lines: 511
- Xref: senator-bedfellow.mit.edu comp.ai:18228 news.answers:11329 comp.answers:1590
-
- Archive-name: ai-faq/part5
- Last-Modified: Mon Jul 19 19:32:40 1993 by Mark Kantrowitz
- Version: 1.9
-
- ;;; ****************************************************************
- ;;; Answers to Questions about Artificial Intelligence *************
- ;;; ****************************************************************
- ;;; Written by Mark Kantrowitz
- ;;; ai-faq-5.text -- 21742 bytes
-
- If you think of questions that are appropriate for this FAQ, or would
- like to improve an answer, please send email to mkant+ai-faq@cs.cmu.edu.
-
- Please note that the FTP Resources are now split across parts 4 and 5
- of the AI FAQ.
-
- Part 5 (FTP Resources):
- [5-1] AI Bibliographies available by FTP
- [5-2] AI Technical Reports available by FTP
- [5-3] Where can I get a machine readable dictionary, thesaurus, and
- other text corpora?
- [5-4] List of Smalltalk implementations.
-
- Search for [#] to get to question number # quickly.
-
- ----------------------------------------------------------------
- Subject: [5-1] AI Bibliographies available by FTP
-
- The Computer Science Department at the University of Saarbruecken, Germany,
- maintains a large bibliographic database of articles pertaining to the
- field of Artificial Intelligence. Currently the database contains more
- than 25,000 references, which can be retrieved by electronic mail from
- the LIDO mailserver at lido@cs.uni-sb.de. Send a mail message with
- subject line "lidosearch help info" to get instructions on using the
- mail server. A variety of queries based on author names, title and
- year of publication are possible. The references can be provided in
- BibTeX or Refer formats. The entire bibliographic database can be
- obtained for a fee by ftp or on tape. Questions may be directed to
- bib-1@cs.uni-sb.de.
-
- A variety of AI-related bibliographies are located on nexus.yorku.ca
- in the directory /pub/bibliographies.
-
- For information on a fairly complete bibliography of computational
- linguistics and natural language processing work from the 1980s, send
- mail to clbib@csli.stanford.edu with the subject HELP.
-
- Stanford University (SUMEX-AIM) has a large BibTeX bibliography of
- Artificial Intelligence papers and technical reports. Available by
- anonymous ftp from aim.stanford.edu:/pub/ai{1,2,3}.bib
-
- A BibTeX database of references addressing neuro-fuzzy issues can be
- obtained by anonymous ftp from ftp.tu-bs.de (134.169.34.15) in the
- directory local/papers as the (ascii) file fuzzy-nn.bib.
-
- Robert Dale's Natural Language Generation (NLG) bibliography is
- available by anonymous ftp from scott.cogsci.ed.uk [129.215.144.3] in
- the directory pub/nlg. Note that it is formatted for A4 paper. For
- further information, write to Robert Dale, University of Edinburgh,
- Centre for Cognitive Science, 2 Buccleuch Place, Edinburgh EH8 9LW
- Scotland, or <R.Dale@edinburgh.ac.uk>.
-
- ----------------------------------------------------------------
- Subject: [5-2] Technical Reports available by FTP
-
- This section lists the anonymous ftp sites for technical reports from
- several universities and other organizations. Some of the sites
- provide only an online catalog of technical reports, while the rest
- make the actual reports available online. The email address listed is
- that of the appropriate person to contact with questions about
- ordering technical reports.
-
- When ftping compressed .Z files, remember to set the transfer type to
- binary first, using the command
- ftp> binary
-
- Other general locations for technical reports from several
- universities include:
- wuarchive.wustl.edu:/doc/techreports/ [128.252.135.4]
- cs-archive.uwaterloo.edu:cs-archive (see Index for an index)
- AKA watdragon.uwaterloo.ca [129.97.140.24]
- The uwaterloo archive includes tech reports from the Logic Programming
- and Artificial Intelligence Group (LPAIG) of the University of Waterloo.
-
- There is also a WAIS server containing tech report abstracts that can be
- searched. To use, create the file ~/wais-sources/cs-techreport-abstracts.src
- containing
- (:source
- :version 3
- :ip-address "130.194.74.201"
- :ip-name "daneel.rdt.monash.edu.au"
- :tcp-port 210
- :database-name "cs-techreport-abstracts"
- :cost 0.00
- :cost-unit :free
- :maintainer "wais@daneel.rdt.monash.edu.au")
- and invoke your local wais client. To add to it, email abstracts of
- your papers to wais@rdt.monash.edu.au in the following format:
- %TI Title
- %AU Author (use multiple %AU lines for multiple authors)
- %PU Published In (citation information)
- %AV Availability (e.g., ftp reports.adm.cs.cmu.edu:1992/CMU-CS-92-101.ps)
- %OR Organization (see cs-techreport-archives.src for institution codes)
- %LT Local title (e.g., tech report number)
- %DA Date (and, if you want, %MN Month, %YR Year)
- %AB Abstract
- If your papers are not available by FTP, you can use a %AV line such as:
- %AV mail harry.bovik@cs.cmu.edu
- Further instructions are available from
- daneel.rdt.monash.edu.au:/pub/techreports/reports/README
- [Based on a post by Ashwin Ram.]
-
- An archive of linguistics papers and preprints is available from
- linguistics.archive.umich.edu:linguistics/papers/. Contact John Lawler
- (jlawler@umich.edu) or linguistics-archivist@umich.edu for more
- information.
-
- The newsgroup comp.doc.techreports is devoted to distributing lists of
- tech reports and their abstracts.
-
- MIT Artificial Intelligence Laboratory:
- ftp -- ftp.ai.mit.edu:ai-pub/{bibliography,general-info,publications}
- email -- publications@ai.mit.edu
- browse -- telnet reading-room.lcs.mit.edu
-
- A full catalog of MIT AI Lab technical reports (and a listing of recent
- updates) may be obtained from the above location, by writing to
- Publications, Room NE43-818, M.I.T. Artificial Intelligence Laboratory,
- 545 Technology Square, Cambridge, MA 02139, USA, or by calling
- 1-617-253-6773. The catalog lists the technical reports ("AI Memos")
- with a short abstract and their current prices. There is also a charge
- for shipping. Some recent tech reports are available in the
- publications/ subdirectory; older technical reports are NOT
- available by ftp.
-
- Sandiway Fong's 1991 PhD thesis, ``The Computational Properties of
- Principle-Based Grammatical Theories,'' may be found in the
- directory pub/sandiway/.
-
- CMU School of Computer Science:
- ftp -- reports.adm.cs.cmu.edu
- email -- Technical.Reports@cs.cmu.edu
-
- CMU Software Engineering Institute:
- ftp -- ftp.sei.cmu.edu:/pub/documents
- email -- bjz@sei.cmu.edu
-
- Yale:
- ftp -- dept.cs.yale.edu:/pub/TR/
-
- University of Washington CSE Tech Reports:
- ftp -- june.cs.washington.edu:/tr
- email -- tr-request@cs.washington.edu
-
- ================
-
- AT&T Bell Laboratories:
- ftp -- research.att.com:/netlib/research/cstr
-
- bib.Z contains short bibliography, including all the technical
- reports contained in this directory.
-
- ftp -- research.att.com:/dist/ai
-
- Argonne National Laboratory:
- ftp -- anagram.mcs.anl.gov:pub/tech_reports
- email -- wright@mcs.anl.gov
-
- Contains MCS Division preprints and technical memoranda,
- available as either .dvi or .ps files. For descriptions of the
- contents, see the subdirectory pub/tech_reports/abstracts; for
- the files themselves see the subdirectory pub/tech_reports/reports.
-
- Boston University:
- ftp -- cs.bu.edu:techreports/
- email -- techreports@cs.bu.edu
-
- Brown University:
- ftp -- wilma.cs.brown.edu:techreports/
- email -- techreports@cs.brown.edu
-
- Cambridge University: Speech, Vision & Robotics Group
- ftp -- svr-ftp.eng.cam.ac.uk:reports/
-
- Columbia University:
- ftp -- cs.columbia.edu:/pub/reports
- email -- tech-reports@cs.columbia.edu
-
- DEC Cambridge Research Lab:
- ftp -- crl.dec.com:/pub/DEC/CRL/{abstracts,tech-reports}
-
- DEC Paris Research Lab:
- email -- doc-server@prl.dec.com
- Put commands in Subject: line of the message.
- To get a list of articles, use
- send index articles
- To get a list of tech reports, use
- send index reports
-
- DEC WRL:
- email -- wrl-techreports@decwrl.dec.com
- To get a helpfile, send a message with
- help
- in the subject line.
-
- DFKI:
- ftp -- duck.dfki.uni-sb.de:/pub/papers
- email -- Martin Henz (henz@dfki.uni-sb.de)
-
- Duke University:
- ftp -- cs.duke.edu:/dist/{papers,theses}
- email -- techreport@cs.duke.edu
-
- Edinburgh:
- A list of available reports can be sent via email. Send requests
- for information about reports from the Center for Cognitive Science
- to cogsci%ed.ac.uk@nsfnet-relay.ac.uk, and from the Human Communication
- Research Center to HCRC%ed.ac.uk@nsfnet-relay.ac.uk.
-
- Electrotechnical Laboratory, Japan:
- Reports from the Cooperative Architecture project (half AI, half
- software engineering).
- ftp -- etlport.etl.go.jp:pub/kyocho/Papers [192.31.197.99]
- See file Index.English.
- email -- Hideyuki Nakashima <nakashim@etl.go.jp>.
-
- Georgia Tech College of Computing, AI Group:
- ftp -- ftp.cc.gatech.edu:pub/ai (130.207.3.245)
- email -- Professor Ashwin Ram <ashwin@cc.gatech.edu>
-
- Illinois:
- email -- Erna Amerman <erna@uiuc.edu>
-
- Illinois Genetic Algorithms Laboratory (IlliGAL):
- email -- Eric Thompson <library@gal1.ge.uiuc.edu>
- phone -- 217-333-2346 (9AM to 5PM CT, M-F)
- mail -- Illinois Genetic Algorithms Laboratory
- Department of General Engineering
- 117 Transportation Building
- 104 South Mathews Avenue
- Urbana, IL 61801-2996
- ftp -- coming soon.
-
- Indiana:
- ftp -- cogsci.indiana.edu:pub [129.79.238.12]
- ftp -- cs.indiana.edu:pub/techreports [129.79.254.191]
-
- INRIA, France:
- ftp -- ftp.inria.fr:INRIA/publication/
-
- Institute for Learning Sciences at Northwestern University:
- ftp -- aristotle.ils.nwu.edu:/pub/papers/
-
- New York University (NYU):
- ftp -- cs.nyu.edu:/pub/tech-reports
-
- OGI:
- ftp -- cse.ogi.edu:/pub/tech-reports
- email -- csedept@cse.ogi.edu
-
- Ohio State University, Laboratory for AI Research
- ftp -- nervous.cis.ohio-state.edu:/pub/papers
- email -- lair-librarian@cis.ohio-state.edu
-
- OSU Neuroprose:
- ftp -- archive.cis.ohio-state.edu:/pub/neuroprose (128.146.8.52)
-
- This directory contains technical reports as a public service to the
- connectionist and neural network scientific community which has an
- organized mailing list (for info: connectionists-request@cs.cmu.edu)
- Includes several bibliographies.
-
- Stanford:
- ftp -- elib.stanford.edu:/cs
-
- Very spotty collection.
-
- SUNY Buffalo:
- ftp -- ftp.cs.buffalo.edu:/pub/tech-reports/
-
- SUNY at Stony Brook:
- ftp -- sbcs.sunysb.edu:/pub/TechReports
- email -- rick@cs.sunysb.edu or stark@cs.sunysb.edu
-
- The /pub/sunysb directory contains the SB-Prolog implementation
- of the Prolog language. Contact warren@sbcs.sunysb.edu for more
- information.
-
- TCGA (The Clearinghouse for Genetic Algorithms):
- email -- Robert Elliott Smith <rob@comec4.mh.ua.edu>
- Department of Engineering of Mechanics
- Room 210 Hardaway Hall
- The University of Alabama
- PO Box 870278
- Tuscaloosa, AL 35487
- 205-348-1618, fax 205-348-6419
-
- Thinking Machines:
- ftp -- ftp.think.com:think/techreport.list
-
- This file contains a list of Thinking Machines technical reports.
- Orders may be placed by email (limit 5) to t-rex@think.com, or by US
- Mail to Thinking Machines Corporation, Attn: Technical reports, 245
- First Street, Cambridge, MA 01241. In addition, the directories
- cm/starlisp and cm/starlogo contain code for the *Lisp and *Logo
- simulators.
-
- Tulane University:
- ftp -- rex.cs.tulane.edu:pub/tech/ [129.81.132.1]
-
- University of Arizona:
- ftp -- cs.arizona.edu:reports/
- email -- tr_libr@cs.arizona.edu
-
- The directory /japan/kahaner.reports contains reports on AI in
- Japan, among other things, written by Dr. David Kahaner, a
- numerical analyst on sabbatical to the Office of Naval
- Research-Asia (ONR Asia) in Tokyo from NIST. The reports are not
- written in any sort of official capacity, but are quite interesting.
-
- University of California/Santa Cruz:
- ftp -- ftp.cse.ucsc.edu:/pub/{bib,tr}
- email -- jean@cs.ucsc.edu
-
- University of Colorado:
- ftp -- ftp.cs.colorado.edu:/pub/cs/techreports
-
- University of Florida:
- ftp -- bikini.cis.ufl.edu:/cis/tech-reports
-
- University of Illinois at Urbana:
- ftp -- a.cs.uiuc.edu:/pub/dcs
- email -- erna@a.cs.uiuc.edu
-
- University of Indiana, Center for Research on Concepts and Cognition:
- ftp -- cogsci.indiana.edu:pub/
- email -- helga@cogsci.indiana.edu
-
- University of Kaiserslautern, Germany:
- ftp -- ftp.uni-kl.de:reports_uni-kl/computer_science/
-
- University of Kentucky:
- ftp -- ftp.ms.uky.edu:ftp/pub/tech-reports/UK/cs/
-
- University of Massachusetts at Amherst:
- email -- techrept@cs.umass.edu
-
- University of Michigan:
- ftp -- ftp.eecs.umich.edu:/techreports
-
- University of North Carolina:
- ftp -- ftp.cs.unc.edu:/pub/technical-reports/
-
- University of Pennsylvania:
- ftp -- ftp.cis.upenn.edu:/pub/papers/
- email -- publications@upenn.edu
-
- USC/Information Sciences Institute:
- email -- Sheila Coyazo <scoyazo@isi.edu> is the contact.
-
- University of Toronto:
- ftp -- ftp.cs.toronto.edu:/pub/reports
- email -- tech-reports@cs.toronto.edu
-
- University of Virginia:
- ftp -- uvacs.cs.virginia.edu:/pub/techreports/cs
-
- University of Wisconsin:
- ftp -- ftp.cs.wisc.edu:/tech-reports
- email -- tech-reports-archive@cs.wisc.edu
-
-
- Some AI authors have set up repositories of their own papers:
-
- Matthew Ginsberg: t.stanford.edu:/u/ftp/papers
-
- ----------------------------------------------------------------
- Subject: [5-3] Where can I get a machine readable dictionary, thesaurus, and
- other text corpora?
-
- Free:
-
- Roget's 1911 Thesaurus is available by anonymous FTP from the
- Consortium for Lexical Research (clr.nmsu.edu, [128.123.1.12]).
- The pathname is /pub/lexica/thesauri/roget-1911.
- It is also available from
- src.doc.ic.ac.uk:/literary/collections/project_gutenberg/roget11.txt.Z
- An old Webster's dictionary is in /text/dict/{DICT.Z,DICT.INDEX.Z}.
- Project Gutenberg also has Roget's 1911 Thesaurus. The Project
- Gutenberg archive is at mrcnext.cso.uiuc.edu:/pub/etext/. The
- Project Gutenberg archive collects public domain electronic books. For more
- information, write to Michael S. Hart, Professor of Electronic Text,
- Executive Director of Project Gutenberg Etext, Illinois Benedictine
- College, Lisle, IL 60532 or send email to hart@vmd.cso.uiuc.edu.
-
- For people without FTP, Austin Code Works sells floppy disks
- containing Roget's 1911 Thesaurus for $40.00. This money helps support
- the production of other useful texts, such as the 1913 Webster's dictionary.
-
- The Online Book Initiative maintains a text repository on
- world.std.com (a public access UNIX system, 617-739-WRLD). See the
- README file on obi.std.com:/obi/. For more information, send email to
- obi@world.std.com, write to Software Tool & Die, 1330 Beacon Street,
- Brookline, MA 02146, or call 617-739-0202.
-
- The CHILDES project at Carnegie Mellon University has a lot of data of
- children speaking to adults, as well as the adult written and adult
- spoken corpora from the CORNELL project. Contact Brian MacWhinney
- <brian@andrew.cmu.edu> for more information.
-
- The Association for Computational Linguistics (ACL) has a Data
- Collection Initiative. For more information, contact Donald Walker at
- Bellcore, walker@flash.bellcore.com.
-
- Two lists of common female first names (4967 names) and male first
- names (2924 names) are available for anonymous ftp from ftp.cs.cmu.edu
- in the directory user/ai/software/nlp/corpora/names/. Read
- the file README first. [Note that you must cd to this directory in one
- atomic operation, as superior directories are protected during an
- anonymous ftp.] Send mail to mkant@cs.cmu.edu for more information.
-
- A list of 110,000 English words (one per line, in ASCII) is
- available in the PD1:<MSDOS.LINGUISTICS> directory on SIMTEL20 as the
- files WORDS1.ZIP, WORDS2.ZIP, WORDS3.ZIP, and WORDS4.ZIP. Although the
- list is in MS-DOS files, it can easily be used on other machines (but
- first you'll have to unzip the files on a DOS machine). The list
- includes inflected forms of the words, such as plural nouns and the
- -s, -ed, and -ing forms of verbs; thus the number of lexical stems in
- the list is considerably smaller than the total number of word forms.
- These files are available via FTP from WSMR-SIMTEL20.ARMY.MIL
- [192.88.110.20]. SIMTEL20 files are mirrored on wuarchive.wustl.edu.
-
- The Collins English Dictionary encoded as a Prolog fact base is
- available from the Oxford Text Archive by anonymous ftp from
- black.ox.ac.uk:ota/dicts/1192/ [129.67.1.165]
- The Oxford Text Archive includes many other texts, dictionaries,
- thesauri, word lists, and so on, most of which are available for
- scholarly use and research only. See the files
- black.ox.ac.uk:ota/textarchive.{form,info,list,sgml}
- for more information, or write to archive@ox.ac.uk, Oxford Text Archive,
- Oxford University Computing Services, 13 Banbury Road, Oxford OX2
- 6NN, UK, call 44-865-273238 or fax 44-865-273275.
-
- Chuck Wooters <wooters@icsi.berkeley.edu> has extracted the most
- likely pronunciation for each of about 6100 words in the hand-labeled
- TIMIT database, and made them available by anonymous ftp from
- ftp.icsi.berkeley.edu:pub/speech/TIMIT.mostlikely.Z.
-
- A list of homophones from general American English is available by
- anonymous ftp from svr-ftp.eng.cam.ac.uk:comp.speech/data/ as the file
- homophones-1.01.txt. To receive the list by email, send mail to
- Evan.Antworth@sil.org. The list was compiled by Tony Robinson.
-
- Commercial:
-
- Illumind publishes the Moby Thesaurus (25,000 roots/1.2 million
- synonyms), Moby Words (560,000 entries), Moby Hyphenator (155,000
- entries), and the Moby Part-of-Speech (214,000 entries) and Moby
- Pronunciator (167,000 entries with IPA encoding, syllabification, and
- primary, secondary, and tertiary stress marks) lexical databases. All
- databases are supplied in pure ASCII, royalty-free, in both Macintosh
- and MS-DOS disk formats (also in .Z file formats). Both commercial (to
- resell derived structures as part of commercial applications) and
- educational/research licenses are available. For more information,
- write to Illumind, Attn: Grady Ward, 3449 Martha Court, Arcata, CA
- 95521, call 707-826-7715, or send email to grady@netcom.com.
-
- The Oxford Text Archive has hundreds of online texts in a wide variety
- of languages, including a few dictionaries (the OED, Collins, etc.).
- The Lancaster-Oslo-Bergen (LOB), Brown, and London-Lund corpii are also
- available from them. For more information, write to Oxford Electronic
- Publishing, Oxford University Press, 200 Madison Avenue, New York, NY
- 10016, call 212-889-0206, or send mail to archive@vax.oxford.ac.uk.
- (Their contact information in England is Oxford Text Archive, Oxford
- University Computing Service, 13 Banbury Road, Oxford OX2 6NN, UK, +44
- (865) 273238.)
-
- Mailing Lists:
-
- CORPORA is a mailing list for Text Corpora. It welcomes information
- and questions about text corpora such as availability, aspects of
- compiling and using corpora, software, tagging, parsing, and
- bibliography. To be added to the list, send a message to
- corpora-request@x400.hd.uib.no. Contributions should be sent to
- corpora@x400.hd.uib.no.
-
- Linguistic Data Consortium:
-
- The Linguistic Data Consortium was established to broaden the collection
- and distribution of speech and natural language data bases for the
- purposes of research and technology development in automatic speech
- recognition, natural language processing, and other areas where large
- amounts of linguistic data are needed. Information about the LDC is
- available by anonymous ftp from ftp.cis.upenn.edu:/pub/ldc [130.91.6.8].
- Documents available in this directory include a paper on the background,
- rationale and goals of the LDC, a brief list of available data bases,
- and some tables summarizing these corpora. For further information,
- contact Elizabeth Hodas, <ehodas@walnut.ling.upenn.edu>, Mark Liberman
- <myl@unagi.cis.upenn.edu>, or Jack Godfrey <jgodfrey@unagi.cis.upenn.edu>.
-
- ----------------------------------------------------------------
- Subject: [5-4] List of Smalltalk implementations.
-
- Little Smalltalk -- Tim Budd's version of Smalltalk
- cs.orst.edu: /pub/budd/small.v3.tar
-
- GNU Smalltalk
- prep.ai.mit.edu:/pub/gnu/smalltalk-1.1.1.tar.Z
-
- ----------------------------------------------------------------
- ;;; *EOF*
-