NetNews Usenet Archive 1992 #20

home *** CD-ROM | disk | FTP | other *** search

/ NetNews Usenet Archive 1992 #20 / NN_1992_20.iso / spool / comp / std / internat / 645 < prev next >

Wrap

Text File | 1992-09-09 | 1.5 KB | 35 lines

Newsgroups: comp.std.internat From: gtoal@pizzabox.demon.co.uk (Graham Toal) Path: sparky!uunet!pipex!demon!pizzabox.demon.co.uk!gtoal Subject: Latin unification in ISO 10646 Distribution: world References: <1992Sep9.163417.8803@corax.udac.uu.se> Organization: Cuddlehogs Anonymous Lines: 22 Date: Thu, 10 Sep 1992 12:21:37 +0000 Message-ID: <716144296snx@pizzabox.demon.co.uk> Sender: usenet@gate.demon.co.uk In article <1992Sep9.163417.8803@corax.udac.uu.se> andersa@Riga.DoCS.UU.SE writes: >Programmers are used to being able to perform case conversion on >letters of the ASCII table by simply adding or subtracting a certain >constant to the character code, given that the code is within a >particular range (A-Z or a-z). With later ISO standards, this is >not quite such a simple task due to the sometimes ad-hoc layout of >lowercase letters with respect to corresponding uppercase letters >(examples available upon request), but it would still be possible >using tables showing the relationship. The upper/lower conversions were a fortunate hack that the original 7-bit character set allowed. These should nowadays be done by the application, along with similar functions to add or remove accents. The invariance of tolower(toupper(x)) has been broken since ESZET -> "SS" -> "ss" first happened. Even if you want to use the simple facilities of the language, rather than writing a custom function, you're still hampered by the <locale.h> stuff being the trivial implementation in just about every Ansi library around :-( G