NetNews Usenet Archive 1993 #1

home *** CD-ROM | disk | FTP | other *** search

/ NetNews Usenet Archive 1993 #1 / NN_1993_1.iso / spool / comp / std / internat / 1093 < prev next >

Wrap

Internet Message Format | 1993-01-07 | 1.5 KB

Path: sparky!uunet!olivea!mintaka.lcs.mit.edu!ai-lab!wheat-chex!glenn From: glenn@wheat-chex.ai.mit.edu (Glenn A. Adams) Newsgroups: comp.std.internat Subject: Re: Dumb Americans (was INTERNATIONALIZATION: JAPAN, FAR EAST) Keywords: Han Kanji Katakana Hirugana ISO10646 Unicode Codepages Message-ID: <1ii0n7INN6ig@life.ai.mit.edu> Date: 7 Jan 93 19:36:07 GMT References: <1993Jan1.114158.17149@prl.dec.com> <1i2emiINN2td@rodan.UU.NET> <1993Jan7.065611.15193@fcom.cc.utah.edu> Organization: MIT Artificial Intelligence Laboratory Lines: 16 NNTP-Posting-Host: wheat-chex.ai.mit.edu In article <1993Jan7.065611.15193@fcom.cc.utah.edu> terry@cs.weber.edu (A Wizard of Earth C) writes: >This example becomes more of a problem when translated to one of a glyph >variant between Chinese and Japanese. I agree that the problem is one >of words, not characters -- however, in ideographic languages, words *are* >characters. The example is not as artificial as you make out. I'm afraid this is incorrect. Ideographic characters in Chinese, Japanese, Korean, and Vietnamese (chu+" no>m va! chu+" ha'n) *are not* equivalent to words. Rather, they are mostly equivalent to morphemes (although there are a number of multiple character morphemes). Many words in these languages are composed from multiple morphemes, e.g., modern Chinese has about 2.2 morphemes (characters) per word on average. For example, in Chinese "zhong1guo2" 'China' and "ri4ben3" 'Japan' are single words but contain two morphemes each. Glenn Adams