NAME

Fancazzista::Scrap - Perl module for scrap reddit post, dev.to post, website content.

It only scrap article/post link and link text.

SYNOPSIS

use Fancazzista::Scrap;

my %config = (
    'websites' => [
        {
            name         => "Korben",
            url          => "https://korben.info",
            selector     => ".status-publish .entry-title",
            linkSelector => "a",
            textSelector => "a"
            limite       => 10 # optionnal 5 by default
        }
    ],
    'subreddits' => [
        {
            "name" => "javascript",
            "limit" => 10 # optionnal 5 by default
        }
    ],
    'devto' => [
        {
            "tag" => "perl",
            "limit" => 10 # optionnal 5 by default
        }
    ]
);

my @scrapped = Fancazzista::Scrap::scrapContent(\%config);

@scrapped :
[
    { 
        name => '<name>', 
        url => '<url'>, 
        articles => [
            { link => '<article-url>', text => '<article-title>' }
        ],
        from_devto => 1 # if source is dev.to
        from_website => 1 # if source if a website
        from_reddit => 1 # if source if reddit
    }
]   

DESCRIPTION

Fancazzista::Scrap allows to scrap website articles or subreddit posts with a config.

SEE ALSO

AUTHOR

Antoine MICELlthttps://miceli.click<gt>

COPYRIGHT AND LICENSE

Copyright (C) 2020 by Antoine MICELI

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.18.4 or, at your option, any later version of Perl 5 you may have available.