home *** CD-ROM | disk | FTP | other *** search
- Comments: Gated by NETNEWS@AUVM.AMERICAN.EDU
- Path: sparky!uunet!paladin.american.edu!auvm!!HELSINKI,
- X-Envelope-to: NOTABENE@TAUNIVM.BITNET
- X-VMS-To: IN%"NOTABENE@TAUNIVM.BITNET"
- X-VMS-Cc: JTAKALA
- MIME-version: 1.0
- Content-type: TEXT/PLAIN; CHARSET=US-ASCII
- Content-transfer-encoding: 7BIT
- Message-ID: <01GSH5XGTF4IAC2XG2@hylk.Helsinki.FI>
- Date: Fri, 18 Dec 92 22:31:38 IST
- Sender: Nota Bene List <NOTABENE@TAUNIVM.BITNET>
- From: "J-P Takala, University of Helsinki,
- Sociology" <JTAKALA@FINUHA.BITNET>
- Subject: separators, NB, Orbis
- Newsgroups: bit.listserv.notabene
- Lines: 30
-
-
- Got a familiar looking table for
- Separators for Words, Sentences, and Paragraphs
- SE:3
- ....etc
- from Mervyn. (Was it really supposed to be in nb3's DEFAULT.SET? Why
- haven't I been punished--or in which way have I been punished--for
- not having it, at least for quite some time now?)
-
- Anyway, it did not seem to affect the way Orbis (format 1) treated
- hyphens. I added the separator definitions to NBCUSTOM.SET, restarted
- NB and created a brand new test Orbis textbase. The result was the
- same as I described earlier. "Eeva riitta" would not find "Eeva-Riitta"
- but "eeva-riitta" would, and that's just the way I like it, and I hope
- this is taken as a feature and not a bug by those who are fixing this
- thing until it's perfect and flawless.
-
- BUT. I guess that this hyphen thing (which I like) is _of a piece_
- with commas and (as I now notice) even hard spaces being treated as
- parts of words (which I would rather not see). And now that I'm
- looking at it, I see that also the period gets treated as part of
- the keyword, as in a Finnish date format: "18.12.1992", which I also
- rather like than dislike. Periods I can only spot in cases where a
- regular alphanumeric immediately follows.
-
- I've heard nobody confirm or disconfirm these things about commas and
- hyphens. Mervyn, does your Orbis treat hyphens as separators?
-
- j-p takala
- jtakala@cc.helsinki.fi
-