NAME

HTML::Grabber

SYNOPSIS

use HTML::Grabber;
use LWP::Simple;

my $dom = HTML::Grabber->new( html => get('http://twitter.com/ned0r') );

$dom->find('li.status')->each(sub {
    my $body = $_->find('.entry-content')->text;
    my $when = $_->find('.entry-date')->text;
    my $link = $_->find('a[rel="bookmark"]')->attr('href');
    say "$body $when (link: $link)";
});

DESCRIPTION

HTML::Grabber provides a jQuery style interface to HTML documents. This makes parsing and manipulating HTML documents trivially simple for those people familiar with http://jquery.com.

It uses XML::LibXML for DOM parsing/manipulation and HTML::Selector::XPath for converting CSS expressions into XPath.

AUTHOR

Martyn Smith <martyn@dollyfish.net.nz>

SELECTORS

All selectors are CSS. They are internally converted to XPath using HTML::Selector::XPath. If some creative selector you're trying isn't working as expected, it may well be worth checking out the documentation for that module to see if it's supported.

METHODS

BUILD

find( $selector )

Get descendants of each element in the current set of matched elements, filtered by a selector.

filter( $match )

Filter the current set of matched elements to those that contain the text specified by $match. If you prefer, $match can also be a Regexp

parent()

Get the parent of each element in the current set of matched elements

text()

Get the combined text contents of each element in the set of matched elements, including their descendants.

html()

Return the HTML of the currently matched elements

remove()

Removes the matched nodes from the DOM tree returning them

attr( $attribute )

Get the value of an attribute for the first element in the set of matched elements.

each

Execute a sub for each matched node

CLASS METHODS

uniq( @nodes )

Internal method for taking a list of XML::LibXML::Elements and returning a unique list

To install HTML::Grabber, copy and paste the appropriate command in to your terminal.

cpanm

cpanm HTML::Grabber

CPAN shell

perl -MCPAN -e shell
install HTML::Grabber

For more information on module installation, please visit the detailed CPAN module installation guide.

	Global
`s`	Focus search bar
`?`	Bring up this help dialog

	GitHub
`g` `p`	Go to pull requests
`g` `i`	go to github issues (only if github is preferred repository)

	POD
`g` `a`	Go to author
`g` `c`	Go to changes
`g` `i`	Go to issues
`g` `d`	Go to dist
`g` `r`	Go to repository/SCM
`g` `s`	Go to source
`g` `b`	Go to file browse

	Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)