NAME
Perlanet - A program for creating programs that aggregate web feeds (both RSS and Atom).
SYNOPSIS
my $perlanet = Perlanet->new;
$perlanet->run;
DESCRIPTION
Perlanet is a program for creating programs that aggregate web feeds (both RSS and Atom). Web pages like this are often called "Planets" after the Python software which originally popularised them. Perlanet is a planet builder written in Perl - hence "Perlanet".
You are probably interested in Perlanet::Simple to get started straight out of the box, batteries included style.
Perlanet itself is the driving force behind everything, however. Perlanet reads a series of web feeds (filtering only those that are valid), sorts and selects entries from these web feeds, and then creates a new aggregate feed and renders this aggregate feed. Perlanet allows the user to customize all of these steps through subclassing and roles.
For most uses, you probably don't want to use the Perlanet module. The perlanet command line program is far more likely to be useful.
CONSTRUCTOR
new
my $perlanet = Perlanet->new
The constructor method. Can be passed a hashref of initialisers.
See "ATTRIBUTES" below for details of the key/value pairs to pass in.
ATTRIBUTES
- config
-
A hash reference that contains the complete contents of the configuration file.
- ua
-
An instance of LWP::UserAgent. Defaults to a simple agent using
$config->{agent}
as the user agent name, orPerlanet/$VERSION
. - cutoff
-
An instance of DateTime which represents the earliest date for which feed posts will be fetched/shown.
- feeds
-
An arrayref of Perlanet::Feed objects representing the feeds to collect data from.
METHODS
fetch_page
Attempt to fetch a web page and a returns a URI::Fetch::Response object.
fetch_feeds
Called internally by "run" and passed the list of feeds in "feeds".
Attempt to download all given feeds, as specified in the feeds
attribute. Returns a list of Perlanet::Feed objects, with the actual feed data loaded.
NB: This method also modifies the contents of "feeds".
select_entries
Called internally by "run" and passed the list of feeds from "fetch_feeds".
Returns a combined list of Perlanet::Entry objects from all given feeds.
sort_entries
Called internally by "run" and passed the list of entries from "select_entries".
Sort the given list of entries into created/modified order for aggregation, and filters them if necessary.
Takes a list of Perlanet::Entrys, and returns an ordered list.
cutoff_entries
Called internally by "run" and passed the list of entries from "sort_entries".
Removes any entries that were published earlier than the cut-off date for this feed.
build_feed
Called internally by "run" and passed the list of entries from "sort_entries".
Takes a list of Perlanet::Entrys, and returns a Perlanet::Feed that is the actual feed for the planet.
clean_html
Clean a HTML string so it is suitable for display.
Takes a HTML string and returns a "cleaned" HTML string.
clean_entries
Clean all entries for the planet.
Takes a list of entries, runs them through clean
and returns a list of cleaned entries.
render
Called internally by "run" and passed the feed from "build_feed".
This is the hook where you generate some type of page to display the result of aggregating feeds together (ie, inserting the posts into a database, running a HTML templating library, etc)
Takes a Perlanet::Feed as input (as generated by build_feed.
run
The main method which runs the perlanet process.
TO DO
See https://github.com/davorg/perlanet/issues
SUPPORT
To report bugs in Perlanet, please use the ticket queue at https://github.com/davorg/perlanet/issues.
SEE ALSO
AUTHOR
Dave Cross, <dave@perlhacks.com>
COPYRIGHT AND LICENSE
Copyright (c) 2010 by Magnum Solutions Ltd.
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.10.0 or, at your option, any later version of Perl 5 you may have available.