SYNOPSIS

use WebService::LOC::CongRec::Crawler;
use Log::Log4perl;
Log::Log4perl->init_once('log4perl.conf');
$crawler = WebService::LOC::CongRec::Crawler->new();
$crawler->congress(107);
$crawler->oldest(1);
$crawler->goForth();

ATTRIBUTES

congress

The numbered congress to be fetched. If this is not given, the current congress is fetched.

issuesRoot

The root page for Daily Digest issues.

Breadcrumb path: Library of Congress > THOMAS Home > Congressional Record > Browse Daily Issues

issues

A hash of issues: %issues{year}{month}{day}{section}

mech

A WWW::Mechanize object with state that we can use to grab the page from Thomas.

oldest

Boolean attribute specifying that pages are visited from earliest to most recent.

The default is 0 - that is visit most recent first.

METHODS

goForth()

$crawler->goForth();
$crawler->goForth(process => \&process_page);
$crawler->goForth(start => $x);
$crawler->goForth(end => $y);

Start crawling from the Daily Digest issues page, i.e. http://thomas.loc.gov/home/Browse.php?&n=Issues

Also, for a specific congress, where NUM is congress number: http://thomas.loc.gov/home/Browse.php?&n=Issues&c=NUM

Returns the total number of pages grabbed.

Accepts an optional processing function to perform for each page.

Accpets optional page counter start and end ranges. If neither are given, or given as zero, crawing starts from the beginning and goes until all pages are visited.

parseRoot(Str $content)

Parse the the root of an issue an fill our hash of available issues

To install WebService::LOC::CongRec::Day, copy and paste the appropriate command in to your terminal.

cpanm

cpanm WebService::LOC::CongRec::Day

CPAN shell

perl -MCPAN -e shell
install WebService::LOC::CongRec::Day

For more information on module installation, please visit the detailed CPAN module installation guide.

	Global
`s`	Focus search bar
`?`	Bring up this help dialog

	GitHub
`g` `p`	Go to pull requests
`g` `i`	go to github issues (only if github is preferred repository)

	POD
`g` `a`	Go to author
`g` `c`	Go to changes
`g` `i`	Go to issues
`g` `d`	Go to dist
`g` `r`	Go to repository/SCM
`g` `s`	Go to source
`g` `b`	Go to file browse

	Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)

SYNOPSIS

ATTRIBUTES

METHODS

goForth()

parseRoot(Str $content)

Module Install Instructions