NAME
Novel::Robot::Parser - get novel / bbs content from website
小说站点解析引擎
INIT
site
support novel website 支持小说站点
support txt file 支持处理txt
txt 指定解析txt文件
support raw file 支持处理序列化数据
raw 指定解析经MessagePack压缩的二进制文件
support forum website 支持论坛站点
hjj 红晋江
tieba 百度贴吧
new
init funtion, need set sitename,or url 初始化解析模块,需指定站点名称或网址
#sitename : 直接指定站点
my $parser = Novel::Robot::Parser->new( site => 'jjwxc' );
#url : 通过url自动检测站点
my $url = 'http://www.jjwxc.net/onebook.php?novelid=2456';
my $parser = Novel::Robot::Parser->new( site => $url );
get_item_ref
get novel / forum thread data hash ref
获取小说/贴子内容,返回一个hash引用
my $r = $parser->get_item_ref($url, %opt);
get_item_info
get novel index info / forum thread first page info
获取小说/贴子首页信息
my $r = $parser->get_item_info($url);
NOVEL FUNCTION
get_novel_ref
get novel data, 获取小说内容
my $r = $parser->get_novel_ref($url, %opt);
get_index_ref
get novel index data, 获取目录页信息
my $index_ref = $parser->get_index_ref($index_url, %opt);
get_chapter_ref
get novel chapter data, 获取章节页信息
my $chapter_url = 'http://m.jjwxc.net/book2/2456/2';
my $chapter_ref = $parser->get_chapter_ref($chapter_url, 2);
TIEZI FUNCTION
get_tiezi_ref
get forum thread data, 获取贴子内容
my $r = $parser->get_tiezi_ref($url, %opt);
BOARD FUNCTION
writer -> multi books
forum board -> multi threads
get_board_ref
get writer / board info, 获取版块信息
my $r = $parser->get_board_ref($url, %opt);
QUERY FUNCTION
get_query_ref
query info, 获取查询结果
my $query_type = '作者';
my $query_keyword = '顾漫';
my ($info, $items_ref) = $parser->get_query_ref( $query_keyword,
query_type => $query_type );