home *** CD-ROM | disk | FTP | other *** search
- Path: sparky!uunet!dtix!darwin.sura.net!wupost!waikato.ac.nz!canterbury.ac.nz!phys169
- From: phys169@csc.canterbury.ac.nz
- Newsgroups: comp.os.msdos.programmer
- Subject: Sorting extended character sets (was Re: The Box-Drawing Characters
- Message-ID: <1992Sep14.113059.805@csc.canterbury.ac.nz>
- Date: 13 Sep 92 23:30:58 GMT
- References: <5265@krafla.rhi.hi.is> <1992Aug31.184726.1583@mits.mdata.fi> <la5apvINN8kb@exodus.Eng.Sun.COM> <Bu9Lv8.6pL@ireq.hydro.qc.ca>
- Organization: University of Canterbury, Christchurch, New Zealand
- Lines: 18
-
- In article <Bu9Lv8.6pL@ireq.hydro.qc.ca>, beaurega@ireq.hydro.qc.ca (Denis Beauregard) writes:
- > There is a "natural" method of sorting according to which language
- > (i.e. French, not Basic :-) ). DOS sort in recent versions takes care
- > of that. The method is quite simple (no idea if they do that, but I
- > do that in my own qsort-based psort program) : if under 0x80, keep it
- > as is (lowercase or lowercase if needed), if over, using a table lookup,
- > remap the character...
-
- I've always wonderred what the "proper" way is to sort letters with accent
- marks (grave, acute, umlaut, etc). Could somebody please explain the order of,
- for example, the standard IBM PC extended characters: 131, 132, 133, 134 and
- 160, which, presumably, come after 97 (a) but before 98 (b)?
-
- I presume all this changes when different country pages are selected, and when
- you are running MS-Windows, and when outputting to a variety of printers all
- with their own characters sets (and so collating sequences).
-
- Mark Aitchison, University of Canterbury, New Zealand.
-