NAME

HTML::Rebase - rewrite HTML links to be relative to a given URL

SYNOPSIS

use HTML::Rebase qw(rebase_html rebase_css);
my $html = <<HTML;
<html>
<head>
<link rel="stylesheet" src="http://localhost:5000/css/site.css" />
</head>
<body>
<a href="http://perlmonks.org">Go to Perlmonks.org</a>
<a href="http://localhost:5000/index.html">Go to home page/a>
</body>
</html>
HTML

my $local_html = rebase_html( "http://localhost:5000/about.html", $html );
print $local_html;
__END__
<html>
<head>
<link rel="stylesheet" src="css/site.css" />
</head>
<body>
<a href="http://perlmonks.org">Go to Perlmonks.org</a>
<a href="index.html">Go to home page/a>
</body>
</html>

rebase_html

Rewrites all HTML links to be relative to the given URL. This only rewrites things that look like src= and href= attributes. Unquoted attributes will not be rewritten. This should be fixed.

rebase_css

Rewrites all CSS links to be relative to the given URL. This only rewrites things that look like url( ... ) .

CAVEATS

Does handle the <base> tag in a specific way

If the HTML contains a <base> tag, it's href= attribute is used as the page URL relative to which links are rewritten.

Uses regular expressions to do all parsing

Instead of parsing the HTML into a DOM, performing the modifications and then writing the DOM back out, this module uses a simplicistic regular expressions to recognize href= and src= attributes and to rewrite them.

REPOSITORY

The public repository of this module is https://github.com/Corion/html-rebase.

SUPPORT

The public support forum of this module is https://perlmonks.org/.

BUG TRACKER

Please report bugs in this module via the RT CPAN bug queue at https://rt.cpan.org/Public/Dist/Display.html?Name=HTML-Rebase or via mail to html-rebase-Bugs@rt.cpan.org.

AUTHOR

Max Maischein corion@cpan.org

COPYRIGHT (c)

Copyright 2015-2018 by Max Maischein corion@cpan.org.

LICENSE

This module is released under the same terms as Perl itself.