NAME

Novel::Robot::Parser - get novel / bbs content from website

小说站点解析引擎

INIT

site

support novel website 支持小说站点

asxs

qywx

ddshu

dingdian

hkslg

jjwxc

kanshuge

kanunu

lwxs

luoqiu

qidian

qqxs

shunong

snwx

tadu

ttzw

yanqingji

ybdu

zhonghuawuxia

zilang

yssm

xiaoxiaoshuwu

biquge

support txt file 支持处理txt

txt 指定解析txt文件

support raw file 支持处理序列化数据

raw 指定解析经MessagePack压缩的二进制文件

support forum website 支持论坛站点

hjj 红晋江

tieba 百度贴吧

new

init funtion, need set sitename,or url 初始化解析模块,需指定站点名称或网址

#sitename : 直接指定站点
my $parser = Novel::Robot::Parser->new( site => 'jjwxc' );
 
#url : 通过url自动检测站点
my $url = 'http://www.jjwxc.net/onebook.php?novelid=2456';
my $parser = Novel::Robot::Parser->new( site => $url );

get_item_ref

get novel / forum thread data hash ref

获取小说/贴子内容,返回一个hash引用

my $r = $parser->get_item_ref($url, %opt);

get_item_info

get novel index info / forum thread first page info

获取小说/贴子首页信息

my $r = $parser->get_item_info($url);

NOVEL FUNCTION

get_novel_ref

get novel data, 获取小说内容

my $r = $parser->get_novel_ref($url, %opt);

get_index_ref

get novel index data, 获取目录页信息

my $index_ref = $parser->get_index_ref($index_url, %opt);

get_chapter_ref

get novel chapter data, 获取章节页信息

my $chapter_url = 'http://m.jjwxc.net/book2/2456/2';
my $chapter_ref = $parser->get_chapter_ref($chapter_url, 2);

TIEZI FUNCTION

get_tiezi_ref

get forum thread data, 获取贴子内容

my $r = $parser->get_tiezi_ref($url, %opt);

BOARD FUNCTION

writer -> multi books

forum board -> multi threads

get_board_ref

get writer / board info, 获取版块信息

my $r = $parser->get_board_ref($url, %opt);

QUERY FUNCTION

get_query_ref

query info, 获取查询结果

my $query_type = '作者';
my $query_keyword = '顾漫';

my ($info, $items_ref) = $parser->get_query_ref( $query_keyword, 
    query_type => $query_type );