home *** CD-ROM | disk | FTP | other *** search
- Newsgroups: comp.std.internat
- From: gtoal@pizzabox.demon.co.uk (Graham Toal)
- Path: sparky!uunet!pipex!demon!pizzabox.demon.co.uk!gtoal
- Subject: Latin unification in ISO 10646
- Distribution: world
- References: <1992Sep9.163417.8803@corax.udac.uu.se>
- Organization: Cuddlehogs Anonymous
- Lines: 22
- Date: Thu, 10 Sep 1992 12:21:37 +0000
- Message-ID: <716144296snx@pizzabox.demon.co.uk>
- Sender: usenet@gate.demon.co.uk
-
-
- In article <1992Sep9.163417.8803@corax.udac.uu.se> andersa@Riga.DoCS.UU.SE writes:
- >Programmers are used to being able to perform case conversion on
- >letters of the ASCII table by simply adding or subtracting a certain
- >constant to the character code, given that the code is within a
- >particular range (A-Z or a-z). With later ISO standards, this is
- >not quite such a simple task due to the sometimes ad-hoc layout of
- >lowercase letters with respect to corresponding uppercase letters
- >(examples available upon request), but it would still be possible
- >using tables showing the relationship.
-
- The upper/lower conversions were a fortunate hack that the original 7-bit
- character set allowed. These should nowadays be done by the application,
- along with similar functions to add or remove accents. The invariance
- of tolower(toupper(x)) has been broken since ESZET -> "SS" -> "ss" first
- happened.
-
- Even if you want to use the simple facilities of the language, rather
- than writing a custom function, you're still hampered by the <locale.h> stuff
- being the trivial implementation in just about every Ansi library around :-(
-
- G
-