home *** CD-ROM | disk | FTP | other *** search
- Path: sparky!uunet!zaphod.mps.ohio-state.edu!cs.utexas.edu!sun-barr!sh.wide!wnoc-tyo-news!cs.titech!titccy.cc.titech!necom830!mohta
- From: mohta@necom830.cc.titech.ac.jp (Masataka Ohta)
- Newsgroups: comp.std.internat
- Subject: Re: Dumb Americans (was INTERNATIONALIZATION: JAPAN, FAR EAST)
- Keywords: Han Kanji Katakana Hirugana ISO10646 Unicode Codepages
- Message-ID: <2676@titccy.cc.titech.ac.jp>
- Date: 10 Jan 93 03:34:26 GMT
- References: <1993Jan7.065611.15193@fcom.cc.utah.edu> <1993Jan8.094119.6795@prl.dec.com> <1993Jan9.031217.27425@fcom.cc.utah.edu> <1in56mINNnhq@life.ai.mit.edu>
- Sender: news@titccy.cc.titech.ac.jp
- Organization: Tokyo Institute of Technology
- Lines: 18
-
- In article <1in56mINNnhq@life.ai.mit.edu>
- glenn@wheat-chex.ai.mit.edu (Glenn A. Adams) writes:
-
- >>>Tell me how I sort on stroke count in Unicode without ``locale-specific
- >>>algorithms or hard-coded data''?
- >
- >If it is Han characters that are being sorted, then Unicode already orders
- >them according to KangXi radicals (214) and (additional) stroke count.
-
- Different characters are mapped to a single code point in 10646/Unicode.
-
- Different Characters with different stroke counts are mapped to a
- single code point in 10646/Unicode.
-
- Thus, if you use 10646/Unicode, stroke count is context dependent. Don't
- use it.
-
- Masataka Ohta
-