NetNews Usenet Archive 1993 #1

home *** CD-ROM | disk | FTP | other *** search

/ NetNews Usenet Archive 1993 #1 / NN_1993_1.iso / spool / comp / std / internat / 1126 < prev next >

Wrap

Internet Message Format | 1993-01-09 | 1.3 KB

Path: sparky!uunet!zaphod.mps.ohio-state.edu!cs.utexas.edu!sun-barr!sh.wide!wnoc-tyo-news!cs.titech!titccy.cc.titech!necom830!mohta From: mohta@necom830.cc.titech.ac.jp (Masataka Ohta) Newsgroups: comp.std.internat Subject: Re: Dumb Americans (was INTERNATIONALIZATION: JAPAN, FAR EAST) Keywords: Han Kanji Katakana Hirugana ISO10646 Unicode Codepages Message-ID: <2676@titccy.cc.titech.ac.jp> Date: 10 Jan 93 03:34:26 GMT References: <1993Jan7.065611.15193@fcom.cc.utah.edu> <1993Jan8.094119.6795@prl.dec.com> <1993Jan9.031217.27425@fcom.cc.utah.edu> <1in56mINNnhq@life.ai.mit.edu> Sender: news@titccy.cc.titech.ac.jp Organization: Tokyo Institute of Technology Lines: 18 In article <1in56mINNnhq@life.ai.mit.edu> glenn@wheat-chex.ai.mit.edu (Glenn A. Adams) writes: >>>Tell me how I sort on stroke count in Unicode without ``locale-specific >>>algorithms or hard-coded data''? > >If it is Han characters that are being sorted, then Unicode already orders >them according to KangXi radicals (214) and (additional) stroke count. Different characters are mapped to a single code point in 10646/Unicode. Different Characters with different stroke counts are mapped to a single code point in 10646/Unicode. Thus, if you use 10646/Unicode, stroke count is context dependent. Don't use it. Masataka Ohta