home *** CD-ROM | disk | FTP | other *** search
- If you read this file _as_is_, just ignore the funny characters you
- see. It is written in the POD format (see perlpod manpage) which is
- specially designed to be readable as is.
-
- The following documentation is written in EUC-CN encoding.
-
- ╚τ╣√─π╙├╥╗░π╡─╬─╫╓▒α╝¡╞≈╘─└└╒Γ╖▌╬─╝■, ╟δ║÷┬╘╬─╓╨╞µ╠╪╡─╫ó╝╟╫╓╖√.
- ╒Γ╖▌╬─╝■╩╟╥╘ POD (╝≥├≈╬─╝■╕±╩╜) ╨┤│╔; ╒Γ╓╓╕±╩╜╩╟╬¬┴╦─▄╚├╚╦╓▒╜╙╘─╢┴,
- ╢°╠╪▒≡╔Φ╝╞╡─. ╣╪╙┌┤╦╕±╩╜╡─╜°╥╗▓╜╨┼╧ó, ╟δ▓╬┐╝ perlpod ╧▀╔╧╬─╝■.
-
- =head1 NAME
-
- perlcn - ╝≥╠σ╓╨╬─ Perl ╓╕─╧
-
- =head1 DESCRIPTION
-
- ╗╢╙¡└┤╡╜ Perl ╡─╠∞╡╪!
-
- ┤╙ 5.8.0 ░µ┐¬╩╝, Perl ╛▀▒╕┴╦═Ω╔╞╡─ Unicode (═│╥╗┬δ) ╓º╘«,
- ╥▓┴¼┤°╓º╘«┴╦╨φ╢α└¡╢í╙∩╧╡╥╘═Γ╡─▒α┬δ╖╜╩╜; CJK (╓╨╚╒║½) ▒π╩╟╞Σ╓╨╡─╥╗▓┐╖▌.
- Unicode ╩╟╣·╝╩╨╘╡─▒Ω╫╝, ╩╘═╝║¡╕╟╩└╜τ╔╧╦∙╙╨╡─╫╓╖√: ╬≈╖╜╩└╜τ, ╢½╖╜╩└╜τ,
- ╥╘╝░┴╜╒▀╝Σ╡─╥╗╟╨ (╧ú└░╬─, ╨≡└√╤╟╬─, ╤╟└¡▓«╬─, ╧ú▓«└┤╬─, ╙í╢╚╬─,
- ╙í╡╪░▓╬─, ╡╚╡╚). ╦ⁿ╥▓╚▌─╔┴╦╢α╓╓╫≈╥╡╧╡═│╙δ╞╜╠¿ (╚τ PC ╝░┬≤╜≡╦■).
-
- Perl ▒╛╔φ╥╘ Unicode ╜°╨╨▓┘╫≈. ╒Γ▒φ╩╛ Perl ─┌▓┐╡─╫╓╖√┤«╩²╛▌┐╔╙├ Unicode
- ▒φ╩╛; Perl ╡─║»╩╜╙δ╦π╖√ (└²╚τ╒²╣µ▒φ╩╛╩╜▒╚╢╘) ╥▓─▄╢╘ Unicode ╜°╨╨▓┘╫≈.
- ╘┌╩Σ╚δ╝░╩Σ│÷╩▒, ╬¬┴╦┤ª└φ╥╘ Unicode ╓«╟░╡─▒α┬δ╖╜╩╜┤µ╖┼╡─╩²╛▌, Perl
- ╠ß╣⌐┴╦ Encode ╒Γ╕÷─ú┐Θ, ┐╔╥╘╚├─π╟ß╥╫╡╪╢┴╚í╝░╨┤╚δ╛╔╙╨╡─▒α┬δ╩²╛▌.
-
- Encode ╤╙╔∞─ú┐Θ╓º╘«╧┬┴╨╝≥╠σ╓╨╬─╡─▒α┬δ╖╜╩╜ ('gb2312' ▒φ╩╛ 'euc-cn'):
-
- euc-cn Unix ╤╙╔∞╫╓╖√╝», ╥▓╛═╩╟╦╫│╞╡─╣·▒Ω┬δ
- gb2312-raw ╬┤╛¡┤ª└φ╡─ (╡═▒╚╠╪) GB2312 ╫╓╖√▒φ
- gb12345 ╬┤╛¡┤ª└φ╡─╓╨╣·╙├╖▒╠σ╓╨╬─▒α┬δ
- iso-ir-165 GB2312 + GB6345 + GB8565 + ╨┬╘÷╫╓╖√
- cp936 ╫╓┬δ╥│ 936, ╥▓┐╔╥╘╙├ 'GBK' (└⌐│Σ╣·▒Ω┬δ) ╓╕├≈
- hz 7 ▒╚╠╪╥▌│÷╩╜ GB2312 ▒α┬δ
-
- ╛┘└²└┤╦╡, ╜½ EUC-CN ▒α┬δ╡─╡╡░╕╫¬│╔ Unicode, ∞≤╨Φ╝ⁿ╚δ╧┬┴╨╓╕┴ε:
-
- perl -Mencoding=euc-cn,STDOUT,utf8 -pe1 < file.euc-cn > file.utf8
-
- Perl ╥▓─┌╕╜┴╦ "piconv", ╥╗╓º═Ω╚½╥╘ Perl ╨┤│╔╡─╫╓╖√╫¬╗╗╣ñ╛▀│╠╨≥, ╙├╖¿╚τ╧┬:
-
- piconv -f euc-cn -t utf8 < file.euc-cn > file.utf8
- piconv -f utf8 -t euc-cn < file.utf8 > file.euc-cn
-
- ┴φ═Γ, └√╙├ encoding ─ú┐Θ, ─π┐╔╥╘╟ß╥╫╨┤│÷╥╘╫╓╖√╬¬╡Ñ╬╗╡─│╠╨≥┬δ, ╚τ╧┬╦∙╩╛:
-
- #!/usr/bin/env perl
- # ╞⌠╢» euc-cn ╫╓┤«╜Γ╬÷; ▒Ω╫╝╩Σ│÷╚δ╝░▒Ω╫╝┤φ╬≤╢╝╔Φ╬¬ euc-cn ▒α┬δ
- use encoding 'euc-cn', STDIN => 'euc-cn', STDOUT => 'euc-cn';
- print length("┬µ═╒"); # 2 (╦½╥²║┼▒φ╩╛╫╓╖√)
- print length('┬µ═╒'); # 4 (╡Ñ╥²║┼▒φ╩╛╫╓╜┌)
- print index("╫╗╫╗╜╠╗σ", "╗╫╗╜"); # -1 (▓╗░ⁿ║¼┤╦╫╙╫╓╖√┤«)
- print index('╫╗╫╗╜╠╗σ', '╗╫╗╜'); # 1 (┤╙╡┌╢■╕÷╫╓╜┌┐¬╩╝)
-
- ╘┌╫ε║≤╥╗┴╨└²╫╙└∩, "╫╗" ╡─╡┌╢■╕÷╫╓╜┌╙δ "╫╗" ╡─╡┌╥╗╕÷╫╓╜┌╜ß║╧│╔ EUC-CN
- ┬δ╡─ "╗╫"; "╫╗" ╡─╡┌╢■╕÷╫╓╜┌╘≥╙δ "╜╠" ╡─╡┌╥╗╕÷╫╓╜┌╜ß║╧│╔ "╗╜".
- ╒Γ╜Γ╛÷┴╦╥╘╟░ EUC-CN ┬δ▒╚╢╘┤ª└φ╔╧│ú╝√╡─╬╩╠Γ.
-
- =head2 ╢ε═Γ╡─╓╨╬─▒α┬δ
-
- ╚τ╣√╨Φ╥¬╕ⁿ╢α╡─╓╨╬─▒α┬δ, ┐╔╥╘┤╙ CPAN (L<http://www.cpan.org/>) ╧┬╘╪
- Encode::HanExtra ─ú┐Θ. ╦ⁿ─┐╟░╠ß╣⌐╧┬┴╨▒α┬δ╖╜╩╜:
-
- gb18030 └⌐│Σ╣²╡─╣·▒Ω┬δ, ░ⁿ║¼╖▒╠σ╓╨╬─
-
- ┴φ═Γ, Encode::HanConvert ─ú┐Θ╘≥╠ß╣⌐┴╦╝≥╖▒╫¬╗╗╙├╡─┴╜╓╓▒α┬δ:
-
- big5-simp Big5 ╖▒╠σ╓╨╬─╙δ Unicode ╝≥╠σ╓╨╬─╗Ñ╫¬
- gbk-trad GBK ╝≥╠σ╓╨╬─╙δ Unicode ╖▒╠σ╓╨╬─╗Ñ╫¬
-
- ╚⌠╧δ╘┌ GBK ╙δ Big5 ╓«╝Σ╗Ñ╫¬, ╟δ▓╬┐╝╕├─ú┐Θ─┌╕╜╡─ b2g.pl ╙δ g2b.pl ┴╜╓º│╠╨≥,
- ╗≥╘┌│╠╨≥─┌╩╣╙├╧┬┴╨╨┤╖¿:
-
- use Encode::HanConvert;
- $euc_cn = big5_to_gb($big5); # ┤╙ Big5 ╫¬╬¬ GBK
- $big5 = gb_to_big5($euc_cn); # ┤╙ GBK ╫¬╬¬ Big5
-
- =head2 ╜°╥╗▓╜╡─╨┼╧ó
-
- ╟δ▓╬┐╝ Perl ─┌╕╜╡─┤≤┴┐╦╡├≈╬─╝■ (▓╗╨╥╚½╩╟╙├╙ó╬─╨┤╡─), └┤╤º╧░╕ⁿ╢α╣╪╙┌
- Perl ╡─╓¬╩╢, ╥╘╝░ Unicode ╡─╩╣╙├╖╜╩╜. ▓╗╣², ═Γ▓┐╡─╫╩╘┤╧α╡▒╖ß╕╗:
-
- =head2 ╠ß╣⌐ Perl ╫╩╘┤╡─═°╓╖
-
- =over 4
-
- =item L<http://www.perl.com/>
-
- Perl ╡─╩╫╥│ (╙╔┼╖└│└±╣½╦╛╬¼╗ñ)
-
- =item L<http://www.cpan.org/>
-
- Perl ╫█║╧╡Σ▓╪═° (Comprehensive Perl Archive Network)
-
- =item L<http://lists.perl.org/>
-
- Perl ╙╩╡▌┬█╠│╥╗└└
-
- =back
-
- =head2 ╤º╧░ Perl ╡─═°╓╖
-
- =over 4
-
- =item L<http://www.oreilly.com.cn/html/perl.html>
-
- ╝≥╠σ╓╨╬─░µ╡─┼╖└│└± Perl ╩Θ╜σ
-
- =back
-
- =head2 Perl ╩╣╙├╒▀╝»╗ß
-
- =over 4
-
- =item L<http://www.pm.org/groups/asia.shtml#China>
-
- ╓╨╣· Perl ═╞╣π╫Θ╥╗└└
-
- =back
-
- =head2 Unicode ╧α╣╪═°╓╖
-
- =over 4
-
- =item L<http://www.unicode.org/>
-
- Unicode ╤º╩⌡╤º╗ß (Unicode ▒Ω╫╝╡─╓╞╢¿╒▀)
-
- =item L<http://www.cl.cam.ac.uk/%7Emgk25/unicode.html>
-
- Unix/Linux ╔╧╡─ UTF-8 ╝░ Unicode ┤≡┐═╬╩
-
- =back
-
- =head1 SEE ALSO
-
- L<Encode>, L<Encode::CN>, L<encoding>, L<perluniintro>, L<perlunicode>
-
- =head1 AUTHORS
-
- Jarkko Hietaniemi E<lt>jhi@iki.fiE<gt>
-
- Autrijus Tang (╠╞╫┌║║) E<lt>autrijus@autrijus.orgE<gt>
-
- =cut
-