home *** CD-ROM | disk | FTP | other *** search
- Newsgroups: sci.lang.japan
- Path: sparky!uunet!gatech!paladin.american.edu!howland.reston.ans.net!sol.ctr.columbia.edu!The-Star.honeywell.com!umn.edu!csus.edu!netcom.com!kschwarz
- From: kschwarz@netcom.com (Kenneth L. Schwarz)
- Subject: Sorting Japanese
- Message-ID: <1993Jan23.204214.29441@netcom.com>
- Organization: Netcom - Online Communication Services (408 241-9760 guest)
- Date: Sat, 23 Jan 1993 20:42:14 GMT
- Lines: 26
-
- Does anyone know how sorts are implemented on lists of Japanese words?
- For example, let's say we have a database with 10 fields per record, and
- these records contain Roman characters, kanji, or kana, or any combination
- thereof. If we want to be able to sort this database with any of the fields as
- the sort key, what do we do?
-
- I know that in Japan telephone books are sorted by aiueo jun (that is, by the
- phonetic order of a kana matrix.) And CDs are sorted in a similar way
- in record stores. So it seems likely that a phonetic sort is the most
- useful organization for human's to deal with.
-
- But that would not be so for computers. Sorting kanji by stroke count is
- easy: there is one and only one stroke count for any particular glyph.
- But to do a phonetic sort would require that the algorithm recognize the
- particular reading of the character in context. This is a bear of a
- problem.
-
- Still, there must be ways that Japanese databases handle such problems.
- Anybody know how it's done?
-
- - Ken
- kschwarz@netcom.com
-
-
-
-
-