![]() |
![]()
| ![]() |
![]()
NAMELingua::Han::Utils - The utility tools of Chinese character(HanZi) SYNOPSISuse Lingua::Han::Utils qw/Unihan_value csplit cdecode csubstr clength/; # cdecode # the same as decode('cp936', $word) in ASCII editing mode # and decode('utf8', $word) in Unicode editing mode my $word = cdecode($word); # Unihan_value # return the first field of Unihan.txt on unicode.org my $word = "我"; my $unihan = Unihan_value($word); # return '6211' my $words = "爱你"; my @unihan = Unihan_value($word); # return (7231, 4F60) my $unihan = Unihan_value($word); # return 72314F60 # csplit # split the Chinese characters into an array my $words = "我爱你"; my @words = csplit($words); # return ("我", "爱", "你") # csubstr # treat the Chinese characters as one # so it's the same as splice(csplit($words), $offset, $length) my $words = "我爱你啊"; my @words = csubstr($words, 1, 2); # return ("爱", "你") my @words = csubstr($words, 1); # return ("爱", "你", "啊") my $words = csubstr($words, 1, 2); # 爱你 # clength # treat the Chinese character as one my $words = "我爱你"; print clength($words); # 3 EXPORTNothing is exported by default. EXPORT_OK
DOCUMENTa Chinese version of document can be found @ <http://www.fayland.org/journal/Lingua-Han-Utils.html> AUTHORFayland Lam, "<fayland at gmail.com>" BUGSPlease report any bugs or feature requests to "bug-lingua-han-utils at rt.cpan.org", or through the web interface at <http://rt.cpan.org/NoAuth/ReportBug.html?Queue=Lingua-Han-Utils>. I will be notified, and then you'll automatically be notified of progress on your bug as I make changes. SUPPORTYou can find documentation for this module with the perldoc command. perldoc Lingua::Han::Utils You can also look for information at:
ACKNOWLEDGEMENTSthe wonderful Encode::Guess COPYRIGHT & LICENSECopyright 2005 Fayland Lam, all rights reserved. This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
|