NetNews Usenet Archive 1993 #3

home *** CD-ROM | disk | FTP | other *** search

/ NetNews Usenet Archive 1993 #3 / NN_1993_3.iso / spool / sci / lang / japan / 5175 < prev next >

Wrap

Text File | 1993-01-24 | 1.4 KB | 36 lines

Newsgroups: sci.lang.japan Path: sparky!uunet!gatech!paladin.american.edu!howland.reston.ans.net!sol.ctr.columbia.edu!The-Star.honeywell.com!umn.edu!csus.edu!netcom.com!kschwarz From: kschwarz@netcom.com (Kenneth L. Schwarz) Subject: Sorting Japanese Message-ID: <1993Jan23.204214.29441@netcom.com> Organization: Netcom - Online Communication Services (408 241-9760 guest) Date: Sat, 23 Jan 1993 20:42:14 GMT Lines: 26 Does anyone know how sorts are implemented on lists of Japanese words? For example, let's say we have a database with 10 fields per record, and these records contain Roman characters, kanji, or kana, or any combination thereof. If we want to be able to sort this database with any of the fields as the sort key, what do we do? I know that in Japan telephone books are sorted by aiueo jun (that is, by the phonetic order of a kana matrix.) And CDs are sorted in a similar way in record stores. So it seems likely that a phonetic sort is the most useful organization for human's to deal with. But that would not be so for computers. Sorting kanji by stroke count is easy: there is one and only one stroke count for any particular glyph. But to do a phonetic sort would require that the algorithm recognize the particular reading of the character in context. This is a bear of a problem. Still, there must be ways that Japanese databases handle such problems. Anybody know how it's done? - Ken kschwarz@netcom.com