NAME

Word::Segmenter::Chinese::Lite - Split Chinese into words

SYNOPSIS

use Word::Segmenter::Chinese::Lite qw(wscl_seg);
my @result = wscl_seg("中华人民共和国成立了oyeah");
print @result;

METHODS

wscl_seg($chinese_article, $max_word_length)

Main method.

Input a chinese article which want to de splited.

Output a list.

$chinese_article -- must be utf8 encoding

$max_word_length -- Optional

EXPORT

no method will be exported by default.

TODOS

1. Support for custom dictionary.

2. Add overlapping-bigram,bigram,1gram algorithm.

AUTHOR

Chen Gang, <yikuyiku.com@gmail.com>

COPYRIGHT AND LICENSE

Copyright (C) 2014 by Chen Gang

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.16.2 or, at your option, any later version of Perl 5 you may have available.

1 POD Error

The following errors were encountered while parsing the POD:

Around line 75:

Non-ASCII character seen before =encoding in 'wscl_seg("中华人民共和国成立了oyeah");'. Assuming UTF-8