home *** CD-ROM | disk | FTP | other *** search
- Path: sparky!uunet!olivea!mintaka.lcs.mit.edu!ai-lab!wheat-chex!glenn
- From: glenn@wheat-chex.ai.mit.edu (Glenn A. Adams)
- Newsgroups: comp.std.internat
- Subject: Re: Dumb Americans (was INTERNATIONALIZATION: JAPAN, FAR EAST)
- Keywords: Han Kanji Katakana Hirugana ISO10646 Unicode Codepages
- Message-ID: <1ii0n7INN6ig@life.ai.mit.edu>
- Date: 7 Jan 93 19:36:07 GMT
- References: <1993Jan1.114158.17149@prl.dec.com> <1i2emiINN2td@rodan.UU.NET> <1993Jan7.065611.15193@fcom.cc.utah.edu>
- Organization: MIT Artificial Intelligence Laboratory
- Lines: 16
- NNTP-Posting-Host: wheat-chex.ai.mit.edu
-
- In article <1993Jan7.065611.15193@fcom.cc.utah.edu> terry@cs.weber.edu (A Wizard of Earth C) writes:
- >This example becomes more of a problem when translated to one of a glyph
- >variant between Chinese and Japanese. I agree that the problem is one
- >of words, not characters -- however, in ideographic languages, words *are*
- >characters. The example is not as artificial as you make out.
-
- I'm afraid this is incorrect. Ideographic characters in Chinese, Japanese,
- Korean, and Vietnamese (chu+" no>m va! chu+" ha'n) *are not* equivalent
- to words. Rather, they are mostly equivalent to morphemes (although there are
- a number of multiple character morphemes). Many words in these languages
- are composed from multiple morphemes, e.g., modern Chinese has about 2.2
- morphemes (characters) per word on average. For example, in Chinese
- "zhong1guo2" 'China' and "ri4ben3" 'Japan' are single words but contain
- two morphemes each.
-
- Glenn Adams
-