NAME

WWW::Crawler::Mojo::Job - Single crawler job

SYNOPSIS

my $job = WWW::Crawler::Mojo::Job->new;

DESCRIPTION

This class represents a single crawler job.

ATTRIBUTES

literal_uri

A Mojo::URL instance of the literal URL that has appeared in the referrer document.

resolved_uri

A Mojo::URL instance of the resolved URL.

referrer

A job instance that has referred the URL.

redirect_history

An array reference that contains URLs of redirect history.

method

HTTP request method such as get or post.

tx_params

A hash reference that contains params for Mojo::Transaction.

METHODS

clone

Clones the job.

child

Initiate a child job by parent job. The parent uri is set to child referrer.

my $job1 = WWW::Crawler::Mojo::Job->new(resolved_uri => 'http://a/1');
my $job2 = $job1->child(resolved_uri => 'http://a/2');
say $job2->referrer # 'http://a/1'

depth

Counts the depth of job in referrer series.

redirect

Replaces the resolved URI and history at once.

my $job = WWW::Crawler::Mojo::Job->new;
$job->resolved_uri($url1);
$job->redirect($url2, $url3);
say $job->resolved_uri # $url2
say $job->redirect_history # [$url1, $url3]

original_uri

Returns the original URI of redirected job.

AUTHOR

Sugama Keita, <sugama@jamadam.com>

COPYRIGHT AND LICENSE

Copyright (C) Sugama Keita.

This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.