home *** CD-ROM | disk | FTP | other *** search
- ;ENGLISH.RUL 2-May-90
- ;Rules file for Englex.
-
- ;Copyright (C) 1991, Summer Institute of Linguistics, Inc.
- ;All rights reserved.
- ;See ENGLEX.DOC for fair use policy.
-
- ;edit history
- ;This description of English is based on the article "A two-level
- ;morphological analysis of English," by Lauri Karttunen and
- ;K. Wittenburg, Texas Linguistic Forum 22:217-228 (1983).
- ;See appendix A of the PC-KIMMO book for an exposition of the rules
- ;in this file.
- ;This version of the English rules differs slightly from the version
- ;supplied on the original PC-KIMMO release diskette.
- ;Note carefully that * is used as the NULL symbol (since 0 is an alphabetic character).
- ;Tables generated by KGEN 2-May-91
- ; 5-Nov-91 version 0.9
- ;25-Nov-91 deleted s:0 and i:y rules (moved after END keyword)
- ;26-Nov-91 version 1.0
-
- ;CONTENTS
- ;Defaults
- ;Epenthesis
- ;y:i-spelling
- ;Elision
- ;Gemination
- ;END
- ;s-deletion
- ;i:y-spelling
-
- ;' = apostrophe
- ;- = hyphen
- ;` = stress
- ;+ = morpheme break
-
- ALPHABET
- b c d f g h j k l m n p q r s t v w x y z a e i o u ' - ` +
- 0 1 2 3 4 5 6 7 8 9
- NULL *
- ANY @
- BOUNDARY #
- SUBSET C b c d f g h j k l m n p q r s t v w x y z
- SUBSET Csib s x z
- SUBSET Cpal c g
- SUBSET V a e i o u
- SUBSET Vbk a o u
-
- RULE "Defaults" 1 31
- b c d f g h j k l m n p q r s t v w x y z a e i o u ' - + ` @
- b c d f g h j k l m n p q r s t v w x y z a e i o u ' - - * @
- 1: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
-
- RULE "Defaults" 1 26
- + y * e * * * * * * * * * * * 0 1 2 3 4 5 6 7 8 9 @
- * i e * b d f g l m n p r s t 0 1 2 3 4 5 6 7 8 9 @
- 1: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
-
- ;Epenthesis
- RULE " 0:e <= [Csib|ch|sh|y:i] +:0_s [+:0|#]" 6 8
- Csib + s # c h y @
- Csib * s # c h i @
- 1: 2 1 6 1 5 1 2 1
- 2: 2 3 6 1 5 1 2 1
- 3: 2 1 4 1 5 1 2 1
- 4: 2 0 6 0 5 2 2 1
- 5: 2 1 6 1 5 2 2 1
- 6: 2 3 6 1 5 2 2 1
-
- RULE " 0:e => [Csib|ch|sh|y:i|o] +:0_s [+:0|#]" 7 10
- * Csib + s # c h y o @
- e Csib * s # c h i o @
- 1: 0 2 1 7 1 6 1 2 2 1
- 2: 0 2 3 7 1 6 1 2 2 1
- 3: 4 2 1 7 1 6 1 2 2 1
- 4. 0 0 0 5 0 0 0 0 0 0
- 5. 0 0 3 0 1 0 0 0 0 0
- 6: 0 2 1 7 1 6 2 2 2 1
- 7: 0 2 3 7 1 6 2 2 2 1
-
- ;y:i-spelling
- RULE " y:i => @:C (+:0)_+:0" 4 5
- y @ e + @
- i C * * @
- 1: 0 2 1 1 1
- 2: 3 2 4 4 1
- 3. 0 0 0 1 0
- 4: 3 2 1 4 1
-
- RULE " y:i /<= @:C (+:0)_+:0 [i|']" 5 7
- @ y e + ' i @
- C i * * ' i @
- 1: 2 1 1 1 1 1 1
- 2: 2 3 5 5 1 1 1
- 3: 2 1 1 4 1 1 1
- 4: 2 1 1 1 0 0 1
- 5: 2 3 1 1 1 1 1
-
- ;Elision
- ;three Elision rules
- RULE "e:0 => VCC*___+:0 V | C[V|y]___+:0 e | Cu___+:0 V" 11 10
- e @ C + e y u i ` @
- * V C * e y u i * @
- 1: 0 2 6 1 2 6 2 2 1 1
- 2: 0 2 3 1 2 3 2 2 2 1
- 3: 4 7 3 1 7 3 11 7 3 1
- 4. 0 0 0 5 0 0 0 0 4 0
- 5. 0 2 0 0 2 0 2 2 5 0 ;prohibits e:0 before y suffix
- 6: 0 7 6 1 7 10 11 7 6 1
- 7: 8 2 3 1 2 3 2 2 7 1
- 8. 0 0 0 9 0 0 0 0 8 0
- 9. 0 0 0 0 2 0 0 0 9 0
- 10: 8 7 6 1 7 10 11 7 10 1
- 11: 4 2 3 1 2 3 2 2 11 1
-
- ;Elision prohibited after Cpal and before back vowel
- RULE " e:0 /<= V C* Cpal___+:0 Vbk" 5 7
- V C Cpal e + Vbk @
- V C Cpal * * Vbk @
- 1: 2 1 1 1 1 2 1
- 2: 2 2 3 1 1 2 1
- 3: 2 1 1 4 1 2 1
- 4: 2 1 1 1 5 2 1
- 5: 2 1 1 1 1 0 1
-
- ;Gemination
- RULE " 0:{b,d,f,g,l,m,n,p,r,s,t} <= `:0 C* V {b,d,f,g,l,m,n,p,r,s,t}_+:0[V|y:@]" 5 18
- ` C V b + y d f g l m n p r s t y @
- * C V b * @ d f g l m n p r s t y @
- 1: 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
- 2: 2 2 3 2 1 1 2 2 2 2 2 2 2 2 2 2 2 1
- 3: 2 1 1 4 1 1 4 4 4 4 4 4 4 4 4 4 1 1
- 4: 2 1 1 1 5 1 1 1 1 1 1 1 1 1 1 1 1 1
- 5: 2 1 0 1 1 0 1 1 1 1 1 1 1 1 1 1 0 1
-
- RULE " 0:{b,d,f,g,l,m,n,p,r,s,t} => C* V {b,d,f,g,l,m,n,p,r,s,t}_+:0[V|y:@]" 15 28
- * * * * * * * * * * * C V b + y d f g l m n p r s t y @
- b d f g l m n p r s t C V b * @ d f g l m n p r s t y @
- 1: 0 0 0 0 0 0 0 0 0 0 0 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
- 2: 0 0 0 0 0 0 0 0 0 0 0 1 2 3 1 1 6 7 8 9 10 11 12 13 14 15 1 1
- 3: 4 0 0 0 0 0 0 0 0 0 0 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
- 4. 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0
- 5. 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 1 0 0 0 0 0 0 0 0 0 0 1 0
- 6: 0 4 0 0 0 0 0 0 0 0 0 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
- 7: 0 0 4 0 0 0 0 0 0 0 0 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
- 8: 0 0 0 4 0 0 0 0 0 0 0 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
- 9: 0 0 0 0 4 0 0 0 0 0 0 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
- 10: 0 0 0 0 0 4 0 0 0 0 0 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
- 11: 0 0 0 0 0 0 4 0 0 0 0 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
- 12: 0 0 0 0 0 0 0 4 0 0 0 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
- 13: 0 0 0 0 0 0 0 0 4 0 0 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
- 14: 0 0 0 0 0 0 0 0 0 4 0 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
- 15: 0 0 0 0 0 0 0 0 0 0 4 1 2 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
-
- END
-
- ;These s-deletion rules handle lexical:surface pairs such as:
-
- ; boy+s+'s dallas+'s
- ; boy0s0'0 dallas0'0
-
- ;If you want to use these rules, move them before the END keyword and
- ;comment out the +' entry in LEXICON GENITIVE in the file english.lex.
-
- ;s-deletion
- RULE " s:0 <= +:0 (0:e) s +:0 '_" 6 7
- s s + * s ' @
- * @ * e s ' @
- 1: 1 1 2 1 1 1 1
- 2: 1 4 2 3 4 1 1
- 3: 1 4 2 1 4 1 1
- 4: 1 1 5 1 1 1 1
- 5: 1 4 2 3 4 6 1
- 6: 1 0 2 1 0 1 1
-
- RULE " s:0 => s +:0 '_" 4 5
- s s + ' @
- * s * ' @
- 1: 0 2 1 1 1
- 2: 0 2 3 1 1
- 3: 0 2 1 4 1
- 4: 1 2 1 1 1
-
- ;These i:y rules account for lexical:surface pairs such as:
-
- ;die+ing tie+ing
- ;dy00ing ty00ing
-
- ;If you want to use these rules, move the two i:y before the END keyword,
- ;replace the Elision rule with the one below, and in the file verb.lex
- ;comment out entries for `dying, `tying, `lying, be`lying, and `vying.
-
- ;i:y-spelling
- ;e:@ instead of e:0 to force Elision to apply
- RULE " i:y <= ___e:@ +:0 i" 4 6
- i i e + i @
- y @ @ * i @
- 1: 1 2 1 1 2 1
- 2: 1 2 3 1 2 1
- 3: 1 2 1 4 2 1
- 4: 1 0 1 1 0 1
-
- RULE " i:y => ___e:0 +:0 i" 4 5
- i e + i @
- y * * i @
- 1: 2 1 1 1 1
- 2. 0 3 0 0 0
- 3. 0 0 4 0 0
- 4. 0 0 0 1 0
-
- ;Elision
- ;three Elision rules plus i:y-spelling
- RULE "e:0 => VCC*___+:0 V | C[V|y]___+:0 e | Cu___+:0 V | i:y___+:0 i" 14 11
- e @ C + e y u i i ` @
- * V C * e y u y i * @
- 1: 0 2 6 1 2 6 2 12 2 1 1
- 2: 0 2 3 1 2 3 2 12 2 2 1
- 3: 4 7 3 1 7 3 11 12 7 3 1
- 4. 0 0 0 5 0 0 0 0 0 4 0
- 5. 0 2 0 0 2 0 2 0 2 5 0 ;prohibits e:0 before y suffix
- 6: 0 7 6 1 7 10 11 12 7 6 1
- 7: 8 2 3 1 2 3 2 12 2 7 1
- 8. 0 0 0 9 0 0 0 0 0 8 0
- 9. 0 0 0 0 2 0 0 0 0 9 0
- 10: 8 7 6 1 7 10 11 12 7 10 1
- 11: 4 2 3 1 2 3 2 12 2 11 1
- 12: 13 2 6 1 2 6 2 12 2 12 1
- 13. 0 0 0 14 0 0 0 0 0 13 0
- 14. 0 0 0 0 0 0 0 0 2 14 0
-
-