NetNews Usenet Archive 1993 #1

home *** CD-ROM | disk | FTP | other *** search

/ NetNews Usenet Archive 1993 #1 / NN_1993_1.iso / spool / comp / unix / bsd / 10906 < prev next >

Wrap

Internet Message Format | 1993-01-07 | 2.2 KB

Xref: sparky comp.unix.bsd:10906 comp.std.internat:1023 Newsgroups: comp.unix.bsd,comp.std.internat Path: sparky!uunet!psinntp!ficc!peter From: peter@ferranti.com (peter da silva) Subject: Re: Dumb Americans (was INTERNATIONALIZATION: JAPAN, FAR EAST) Message-ID: <id.E1FW.PX5@ferranti.com> Followup-To: comp.std.internat Keywords: Han Kanji Katakana Hirugana ISO10646 Unicode Codepages Organization: Xenix Support, FICC References: <1ht8v4INNj7i@rodan.UU.NET> <1993Jan1.094759.8021@fcom.cc.utah.edu> <1i2k09INN4hl@rodan.UU.NET> Date: Mon, 4 Jan 1993 15:35:27 GMT Lines: 34 In article <1i2k09INN4hl@rodan.UU.NET> avg@rodan.UU.NET (Vadim Antonov) writes: > You omitted one small "detail" -- you need to know the language of the word > the letter belongs to to make a conversion. Yes. > Since Unicode does not > provide for specifying the language it is obvious that is should be > obtained from user or kept somewhere off the text. In both cases > as our program ALREADY knows the language from the environment it knows > the particular (small) alphabet -- no need to use multibyte encodings! Unless you want your document to contain multilingual data. *your* solution is only useful for documents containing a single language, in which case why bother with ISO8859.*... a separate character code table for every language is quite acceptable. For that matter, you can take the next step and say "why standardise character sets when every application has specific needs? Financial packages need dozens of currency symbols, for example, and mathematics needs a whole host of its own symbols... each application knows the set it needs to use... and for most text documents a 6-bit code is quite adequate"... You have identified two problems with Unicode and ISO 10646: case conversion and lexical ordering. > See how Unicode renders itself useless? Unless you want to work on multilingual documents, yes. It could be better, certainly, but to say it's *useless* is hyperbole. -- Peter da Silva `-_-' Ferranti International Controls Corporation 'U` Sugar Land, TX 77487-5012 USA +1 713 274 5180 "Zure otsoa besarkatu al duzu gaur?"