NAME
Fancazzista::Scrap - Perl module for scrap reddit post, dev.to post, website content.
It only scrap article/post link and link text.
SYNOPSIS
use Fancazzista::Scrap;
my %config = (
'websites' => [
{
name => "Korben",
url => "https://korben.info",
selector => ".status-publish .entry-title",
linkSelector => "a",
textSelector => "a"
limite => 10 # optionnal 5 by default
}
],
'subreddits' => [
{
"name" => "javascript",
"limit" => 10 # optionnal 5 by default
}
],
'devto' => [
{
"tag" => "perl",
"limit" => 10 # optionnal 5 by default
}
]
);
my @scrapped = Fancazzista::Scrap::scrapContent(\%config);
@scrapped :
[
{
name => '<name>',
url => '<url'>,
articles => [
{ link => '<article-url>', text => '<article-title>' }
],
from_devto => 1 # if source is dev.to
from_website => 1 # if source if a website
from_reddit => 1 # if source if reddit
}
]
DESCRIPTION
Fancazzista::Scrap allows to scrap website articles or subreddit posts with a config.
SEE ALSO
AUTHOR
Antoine MICELlthttps://miceli.click<gt>
COPYRIGHT AND LICENSE
Copyright (C) 2020 by Antoine MICELI
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.18.4 or, at your option, any later version of Perl 5 you may have available.