NAME

CSS::Inliner - Library for converting CSS <style> blocks to inline styles.

SYNOPSIS

use CSS::Inliner;

my $inliner = new CSS::Inliner();

$inliner->read_file({ filename => 'myfile.html' });

print $inliner->inlinify();

DESCRIPTION

Library for converting CSS style blocks into inline styles in an HTML document. Specifically this is intended for the ease of generating HTML emails. This is useful as certain email clients don't support top level <style> declarations despite it being 2017.

NOTE: The development of the this module is currently suspended as the sponsoring entity, MailerMailer LLC, has been sold to j2 Global.

METHODS

new

Instantiates the Inliner object. Sets up class variables that are used during file parsing/processing. Possible options are:

html_tree - (optional) Pass in a fresh unparsed instance of HTML::Treebuilder

NOTE: Any passed references to HTML::TreeBuilder will be substantially altered by passing it in here...

strip_attrs - (optional) Remove all "id" and "class" attributes during inlining

leave_style - (optional) Leave style/link tags alone within <head> during inlining

relaxed - (optional) Relaxed HTML parsing which will attempt to interpret non-HTML4 documents.

NOTE: This argument is not compatible with passing an html_tree.

agent - (optional) Pass in a string containing a preferred user-agent, overrides the internal default provided by the module for handling remote documents

fetch_file

Fetches a remote HTML file that supposedly contains both HTML and a style declaration, properly tags the data with the proper charset as provided by the remote webserver (if any). Subsequently calls the read method automatically.

This method expands all relative urls, as well as fully expands the stylesheet reference within the document.

This method requires you to pass in a params hash that contains a url argument for the requested document. For example:

$self->fetch_file({ url => 'http://www.example.com' });

Note that you can specify a user-agent to override the default user-agent of 'Mozilla/4.0' within the constructor. Doing so may avoid certain issues with agent filtering related to quirky webserver configs.

Input Parameters: url - the desired url for a remote asset presumably containing both html and css charset - (optional) programmer specified charset for the pass url

read_file

Opens and reads an HTML file that supposedly contains both HTML and a style declaration, properly tags the data with the proper charset if specified. It subsequently calls the read() method automatically.

This method requires you to pass in a params hash that contains a filename argument. For example:

$self->read_file({ filename => 'myfile.html' });

Additionally you can specify the character encoding within the file, for example:

$self->read_file({ filename => 'myfile.html', charset => 'utf8' });

Input Parameters: filename - name of local file presumably containing both html and css charset - (optional) programmer specified charset of the passed file

read

Reads passed html data and parses it. The intermediate data is stored in class variables.

The <style> block is ripped out of the html here, and stored separately. Class/ID/Names used in the markup are left alone.

This method requires you to pass in a params hash that contains scalar html data. For example:

$self->read({ html => $html });

NOTE: You are required to pass a properly encoded perl reference to the html data. This method does *not* do the dirty work of encoding the html as utf8 - do that before calling this method.

Input Parameters: html - scalar presumably containing both html and css charset - (optional) scalar representing the original charset of the passed html

detect_charset

Detect the charset of the passed content.

The algorithm present here is roughly based off of the HTML5 W3C working group document, which lays out a recommendation for determining the character set of a received document, which can be seen here under the "determining the character encoding" section: http://www.w3.org/TR/html5/syntax.html

NOTE: In the event that no charset can be identified the library will handle the content as a mix of UTF-8/CP-1252/8859-1/ASCII by attempting to use the Encoding::FixLatin module, as this combination is relatively common in the wild. Finally, if Encoding::FixLatin is unavailable the content will be treated as ASCII.

Input Parameters: content - scalar presumably containing both html and css charset - (optional) programmer specified charset for the passed content ctcharset - (optional) content-type specified charset for content retrieved via a url

decode_characters

Implement the character decoding algorithm for HTML as outlined by the various working groups

Basically apply best practices for determining the applied character encoding and properly decode it

It is expected that this method will be called before any calls to read()

Input Parameters: content - scalar presumably containing both html and css charset - known charset for the passed content

inlinify

Processes the html data that was entered through either 'read' or 'read_file', returns a scalar that contains a composite chunk of html that has inline styles instead of a top level <style> declaration.

query

Given a particular selector return back the applicable styles

specificity

Given a particular selector return back the associated selectivity

content_warnings

Return back any warnings thrown while inlining a given block of content.

Note: content warnings are initialized at inlining time, not at read time. In order to receive back content feedback you must perform inlinify first

AUTHOR

Kevin Kamel <kamelkev@underprint.com>

CONTRIBUTORS

Dave Gray <cpan@doesntsuck.com>
Vivek Khera <vivek@khera.org>
Michael Peters <wonko@cpan.org>
Chelsea Rio <chelseario@gmail.com>

LICENSE

This module is Copyright 2017 Khera Communications, Inc. It is licensed under the same terms as Perl itself.