perlcn − XXXXXXXXX Perl XXXXXX |
XXXXXXXXX Perl XXXXXXX ! XX 5.8.0 XXXXXXX , Perl XXXXXXXXXXXXXXX Unicode ( XXXXXXX ) XXXXXX , XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX ; CJK ( XXXXXXX ) XXXXXXXXXXXXXXXXX . Unicode XXXXXXXXXXXXXXXXXXX , XXXXXXXXXXXXXXXXXXXXXXXXXXXXX: XXXXXXXXX , XXXXXXXXXX , XXXXXXXXXXXXXXXXXX ( XXXXXXXX , XXXXXXXXXXX , XXXXXXXXXX , XXXXXXXXX , XXXXXXX , XXXXXXXXXX , XXXX ). XXXXXXXXXXXXXXXXXXXXXXXXXXX ( XX PC XXXXXXXXXX ). Perl XXXXXXX Unicode XXXXXXXXX . XXXXX Perl XXXXXXXXXXXXXXXXXXXXXXXX Unicode XXX ; Perl XXXXXXXXXXXXXX ( XXXXXXXXXXXXXXXXX ) XXXXXXX Unicode XXXXXXXXX . XXXXXXXXXXXXXXXXX , XXXXXXXXXXX Unicode XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX , Perl XXXXXXX Encode XXXXXXXX , XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX . Encode XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX (’gb2312’ XXX ’euc−cn’): euc−cn Unix XXXXXXXXXX, XXXXXXXXXXXXXXXXXXXX gb2312−raw XXXXXXXXXXXXX (XXXXXXX) GB2312 XXXXX gb12345 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX iso−ir−165 GB2312 + GB6345 + GB8565 + XXXXXXXX cp936 XXXXXXX 936, XXXXXXXXX 'GBK' (XXXXXXXXXXXXX) XXXXXX hz 7 XXXXXXXXXXX GB2312 XXXXX XXXXXXXXX , XX EUC-CN XXXXXXXXXXXXXXXXX Unicode, XXXXXXXXXXXXXXXXXXXXX: perl −Mencoding=euc−cn,STDOUT,utf8 −pe1 < file.euc−cn > file.utf8 Perl XXXXXXXXXX "piconv", XXXXXXXXXXX Perl XXXXXXXXXXXXXXXXXXXXXXXXXX , XXXXXXXXX: piconv −f euc−cn −t utf8 < file.euc−cn > file.utf8 piconv −f utf8 −t euc−cn < file.utf8 > file.euc−cn XXXXX , XXXXX encoding XXXX , XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX , XXXXXXXXX: #!/usr/bin/env perl # XXXXXX euc−cn XXXXXXXX; XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX euc−cn XXXXX use encoding 'euc−cn', STDIN => 'euc−cn', STDOUT => 'euc−cn'; print length("XXXX"); # 2 (XXXXXXXXXXXXXXX) print length('XXXX'); # 4 (XXXXXXXXXXXXXXXX) print index("XXXXXXXXX", "XXXXXX"); # −1 (XXXXXXXXXXXXXXXXXX) print index('XXXXXXXXX', 'XXXXXX'); # 1 (XXXXXXXXXXXXXXXXX) XXXXXXXXXXXXXXXXXXXX , " XX " XXXXXXXXXXXXXXXX " XX " XXXXXXXXXXXXXXXXXXXXXX EUC-CN XXXXXX " XXX "; " XX " XXXXXXXXXXXXXXXXXXX " XXX " XXXXXXXXXXXXXXXXXXXXXX " XXX ". XXXXXXXXXXXX EUC-CN XXXXXXXXXXXXXXXXXXXXXXXXXX . |
XXXXXXXXXXXXXXXXX |
XXXXXXXXXXXXXXXXXXXXXXXXXXXX , XXXXXXX CPAN (<http://www.cpan.org/>) XXX Encode::HanExtra XXXX . XXXXXXXXXXXXXXXXXXXXXXXXXXXX: gb18030 XXXXXXXXXXXXXXXXXX, XXXXXXXXXXXXXXX XXXXX , Encode::HanConvert XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX: big5−simp Big5 XXXXXXXXXXX Unicode XXXXXXXXXXXXX gbk−trad GBK XXXXXXXXXXX Unicode XXXXXXXXXXXXX XXXXXXX GBK XX Big5 XXXXXXXXX , XXXXXXXXXXXXXXXXXXXXXX b2g.pl XX g2b.pl XXXXXXXX , XXXXXXXXXXXXXXXXXXXXXXXXXXX: use Encode::HanConvert; $euc_cn = big5_to_gb($big5); # XX Big5 XXXX GBK $big5 = gb_to_big5($euc_cn); # XX GBK XXXX Big5 |
XXXXXXXXXXXX |
XXXXXXX Perl XXXXXXXXXXXXXXXXXXXXXX ( XXXXXXXXXXXXXXXXXXXXX ), XXXXXXXXXXXXXXXXX Perl XXXXXXXX , XXXX Unicode XXXXXXXXXXX . XXXX , XXXXXXXXXXXXXXXXXXXX: |
XXXXX Perl XXXXXXXXXXXX |
<http://www.perl.com/> |
Perl XXXXXXX ( XXXXXXXXXXXXXXXXXX ) |
<http://www.cpan.org/> |
Perl XXXXXXXXXXX (Comprehensive Perl Archive Network) |
<http://lists.perl.org/> |
Perl XXXXXXXXXXXXXX |
XXXX Perl XXXXXXXX |
<http://www.oreilly.com.cn/html/perl.html> |
XXXXXXXXXXXXXXXXXXXX Perl XXXXX |
Perl XXXXXXXXXXX |
<http://www.pm.org/groups/asia.html> |
XXXXX Perl XXXXXXXXXX |
Unicode XXXXXXXXXX |
<http://www.unicode.org/> |
Unicode XXXXXXXXX (Unicode XXXXXXXXXXXXXXXXX ) |
<http://www.cl.cam.ac.uk/%7Emgk25/unicode.html> |
Unix/Linux XXXXX UTF−8 XXX Unicode XXXXXX |
Encode, Encode::CN, encoding, perluniintro, perlunicode |
Jarkko Hietaniemi <jhi@iki.fi> Audrey Tang ( XXXXXX ) <audreyt@audreyt.org> |