NAME

SQL::Interpolate - Interpolate Perl variables into SQL statements

SYNOPSIS

use SQL::Interpolate qw(:all);

# Some sample data to interpolate:
my $s = 'blue'; my @v = (5, 6);

# Variable references are transformed into bind parameters.
# The most basic usage involves scalarrefs (as well as arrayrefs
# preceeded by "IN").
my ($sql, @bind) = sql_interp
  'SELECT * FROM table WHERE x = ', \$s, 'AND y IN', \@v;
# RESULT:
#   $sql  = 'SELECT * FROM mytable WHERE x = ? AND y IN (?, ?)'
#   @bind = ($s, @v);

# In certain contexts, an arrayref or hashref acts as a single tuple:
my ($sql, @bind) = sql_interp
  'INSERT INTO table', {x => $s, y => 1};
# RESULT:
#   $sql  = 'INSERT INTO mytable (x, y) VALUES(?, ?)';
#   @bind = ($s, 1);
my ($sql, @bind) = sql_interp
  'UPDATE table SET', {x => $s, y => 1}, 'WHERE y <> ', \2;
# RESULT:
#   $sql  = 'UPDATE mytable SET x = ?, y = ? WHERE y <> ?';
#   @bind = ($s, 1, 2);

# In general, a hashref provides a shortcut for specifying
# a logical-AND construction:
my ($sql, @bind) = sql_interp
  'SELECT * FROM table WHERE', {x => $s, y => \@v};
# RESULT:
#   $sql  = 'SELECT * FROM mytable WHERE (x = ? AND y IN (?, ?))';
#   @bind = ($s, @v);

# In general, an arrayref acts as a result set or reference to
# a temporary table:
my ($sql, @bind) = sql_interp
  [[1, 2], [4, 5]], 'UNION', [{x => 2, y => 3}, {x => 5, y => 6};
# RESULT:
#   $sql  = '(SELECT ?, ? UNION ALL SELECT ?, ?) UNION
#            (SELECT ? AS x, ? AS y UNION ALL SELECT ?, ?)';
#   @bind = (1,2,4,5, 2,3,5,6);
my ($sql, @bind) = sql_interp
  'SELECT * FROM', [[1, 2], [4, 5]]
# RESULT:
#   $sql  = 'SELECT * FROM (SELECT ?, ? UNION ALL SELECT ?, ?) AS tbl0';
#   @bind = (1,2,4,5);

# Each result above is suitable for passing to DBI:
my $res = $dbh->selectall_arrayref($sql, undef, @bind);

# Besides these simple techniques shown, SQL-Interpolate includes
# various optional modules to further integrate SQL::Interpolate with
# DBI and streamline the syntax with source filtering and macros (see
# the L</SEE ALSO> section):

use DBIx::Interpolate FILTER => 1;
...
my $rows = $dbx->selectall_arrayref(sql[
    SELECT thid, date, title, subject
    FROM threads
    WHERE date > $x AND subject IN @subjects
]);

DESCRIPTION

Purpose

SQL::Interpolate interpolates Perl variables into SQL statements in a simplified manner. It converts a list of intermixed SQL fragments and variable references into a conventional SQL string and list of bind values suitable for passing onto DBI.

When using plain DBI, one traditionally interpolates reliably with bind values, which can become unwieldy:

$dbh->do(qq(
    INSERT INTO table (color, shape, width, height, length)
                VALUES(?,     ?,     ?,     ?,      ?     )
), undef, $c, $s, $w, $h, $l);

This can be ameliorated somewhat with "SQL building techniques," but SQL::Interpolate eliminates much of this need with a terse Perl-like syntax:

my ($sql, @bind) = sql_interp 'INSERT INTO table',
    {color => $c, shape => $s, width => $w, height => $h, length => $l};
$dbh->do($sql, undef, @bind);

Security notes

SQL::Interpolate properly binds or escapes variables. This recommended practice safeguards against "SQL injection" attacks. The DBI documentation has several links on the topic.

INTERFACE

The central function of this module is sql_interp(). The rest of this distribution provides alternative interfaces, supporting functionality, and wrappers for sql_interp().

sql_interp

($sql, @bind) = sql_interp @params;            # functional interface
($sql, @bind) = $interp->sql_interp(@params);  # OO interface

sql_interp() strings together the given list of elements to interpolate (@params), called the "interpolation list," and returns both an SQL string ($sql) with placeholders ("?") and a corresponding list of bind values (@bind) suitable for passing to DBI.

The interpolation list can contain elements of these types (the first two are most often used):

* SQL - string containing a raw SQL fragment such as 'SELECT * FROM mytable WHERE'.

* variable reference - scalarref, arrayref, hashref, or "sql_var" object referring to data to interpolate between the SQL.

* macro - object that may be further expanded into previous three types of elements. Some strings may expand into macro objects. Macros are explained in SQL::Interpolate::Macro.

* other interpolation list - an interpolation list can be nested inside another interpolation list. This is possible with the sql() function.

In addition, the first element in the interpolation list may represent state data in the form of a database handle or (for the OO interface) an instance of SQL::Interpolate. sql_interp and macros may use this state data to customize the output, such as for a dialect of SQL.

The basic interpolation process is as follows. Strings are appended to the output SQL ($sql), possibly with some context-dependent tweaking. For each scalar variable reference, a corresponding placeholder ("?") and other SQL is appended to $sql, and the de-referenced value is pushed onto @bind. Nested interpolation lists are generally flattened.

Interpolation Examples

# sample data to interpolate
my $s  = \3;                   # scalarref
my $v  = [1, 2];               # arrayref (vector)
my $h  = {m => 1, n => undef}; # hashref
my $hv = {v => $v, s => $$s};  # hashref containing arrayref
my $vv = [$v, $v];             # arrayref of arrayref
my $vh = [$h, $h];             # arrayref of hashref
Let $x stand for \$s, $v, $h, $hv, $vv, or $vh.

Default scalarref behavior -- single bind value

INPUT:  'foo', $s, 'bar'
OUTPUT: 'foo ? bar', $$s

Default hashref behavior -- logical-AND

INPUT:  'WHERE', $x
OUTPUT: 'WHERE (m=? AND n IS NULL)', $h->{m},      # $x = $h
OUTPUT: 'WHERE (v IN (?, ?) AND s = ?)', @$v, $$s  # $x = $hv

Default arrayref behavior -- invalid

# Arrayrefs do not have a default behavior.  They cannot
# be used except in the special contexts described later.
INPUT:  'SELECT', [1, 2, 3]  # INVALID

Context ('IN', $x) -- list

INPUT:  'WHERE x IN', $x
OUTPUT: 'WHERE x IN (?, ?)', @$v                   # $x = $v
OUTPUT: 'WHERE x IN (?)', $$s                      # $x = $s
OUTPUT: 'WHERE 1=0'                                # $x = []
# Note: An arrayref of length 0 is handled specially
# because 'WHERE x IN ()' can be invalid SQL (e.g. MySQL/PostgreSQL).

Context ('INSERT INTO tablename', $x) -- tuple

INPUT:  'INSERT INTO mytable', $x
OUTPUT: 'INSERT INTO mytable (m, n) VALUES(?, ?)', # $x = $h
        $h->{m}, $h->{n}
OUTPUT: 'INSERT INTO mytable VALUES(?, ?)', @$v;   # $x = $v
OUTPUT: 'INSERT INTO mytable VALUES(?)', $$s;      # $x = $s

Context ('SET', $x) -- tuple

INPUT:  'UPDATE mytable SET', $h
OUTPUT: 'UPDATE mytable SET m = ?, n = ?', $h->{m}, $h->{n}

Default arrayref of (hashref or arrayref) behavior -- result set

INPUT:  $x
OUTPUT: '(SELECT ?, ? UNION ALL SELECT ?, ?)',
        map {@$_} @$v                           # $x = $vv
OUTPUT: '(SELECT ? as m, ? as n UNION ALL
          SELECT ?, ?)',
        $vh->[0]->{m}, $vh->[0]->{n},
        $vh->[1]->{m}, $vh->[1]->{n}            # $x = $vh

# Typical usage:
INPUT: $x
INPUT: $x, 'UNION [ALL|DISTINCT]', $x
INPUT: 'INSERT INTO mytable', $x
INPUT: 'SELECT * FROM mytable WHERE x IN', $x

Context ('FROM | JOIN', $x) -- table reference

IN:  'SELECT * FROM', $x
OUT: 'SELECT * FROM
     (SELECT ?, ? UNION ALL SELECT ?, ?) as t001',
     map {@$_} @$v                                     # $x = $vv
OUT: 'SELECT * FROM
     (SELECT ? as m, ? as n UNION ALL SELECT ?, ?) as temp001',
     $vh->[0]->{m}, $vh->[0]->{n},
     $vh->[1]->{m}, $vh->[1]->{n}                      # $x = $vh

IN:  'SELECT * FROM', $vv, 'AS t'
OUT: 'SELECT * FROM
     (SELECT ?, ? UNION ALL SELECT ?, ?) AS t',
     map {@$_} @$v

# Example usage (where $x and $y are table references):
'SELECT * FROM', $x, 'JOIN', $y

Other rules

Whitespace is automatically added between parameters:

INPUT:  'UPDATE', 'mytable SET', {x => 2}, 'WHERE y IN', \@colors;
OUTPUT: 'UPDATE mytable SET x = ? WHERE y in (?, ?)', 2, @colors

Variables must be passed as references (possibly using the sql// operator when source filtering is enabled); otherwise, they will processed as SQL fragments and interpolated verbatim into the result SQL string, negating the security and performance benefits of binding values.

In contrast, any scalar values inside an arrayref or hashref are by default treated as binding variables, not SQL. The contained elements may be also be sql_var(), sql(), or macro objects.

sql_interp will Do The Right Thing(TM) on trivial cases:

INPUT:  'SELECT * FROM table WHERE color IN', []
OUTPUT: 'SELECT * FROM table WHERE 1=0'
# invalid to MySQL: SELECT * FROM table WHERE color IN ()

SQL::Interpolate does not attempt to further optimize away such expressions. Databases are designed to do query optimization, without loss of generality.

Variable interpolation is context-sensitive. The same variable references can generate different SQL sub-expressions depending on context:

INPUT:  'INSERT INTO mytable', $h
OUTPUT: 'INSERT INTO mytable (m, n) VALUES(?, ?)', ...

INPUT:  'UPDATE mytable SET', $h
OUTPUT: 'UPDATE mytable SET m = ?, n = ?', ...

Error handling: On error, sql_interp will croak with a string message.

Other notes

An interesting property is that result sets allow passing DBI results back into the interpolation list:

# DBIx::Interpolate example (note: $vv2 is identical to $vv)
my $vv2 = $dbx->selectall_arrayref($dbx->selectall_arrayref($vv));

In certain cases (e.g. for INSERT and IN), a result set can be used as an alternative to a single tuple:

# (tuple)
INPUT:  'SELECT * FROM mytable WHERE x IN', $v
OUTPUT: 'SELECT * FROM mytable WHERE x IN (?, ?)', @$v
# (subselect)
INPUT:  'SELECT * FROM mytable WHERE x IN', [$v]
OUTPUT: 'SELECT * FROM mytable WHERE x IN (SELECT ?, ?)', @$v

# (tuple)
INPUT:  'INSERT INTO mytable', $v
OUTPUT: 'INSERT INTO mytable VALUES (?, ?)', @$v
# (subselect)
INPUT:  'INSERT INTO mytable', [$v]
OUTPUT: 'INSERT INTO mytable (SELECT ?, ?)', @$v

sql

my $sqlobj = sql(@params);

sql creates an object of type SQL::Interpolate::SQL representing an interpolation list of the elements in @params.

sql() is useful only in exceptional cases. For example, if you want insert raw SQL as a value in an arrayref or hashref, it cannot be done with a plain string because any scalar value in an arrayref or hashref is interpreted as a binding variable. An sql object must be used explicitly for force a change in context:

sql_interp 'INSERT INTO mytable',
    {x => $x, y => sql('CURRENT_TIMESTAMP')};
# OUTPUT: 'INSERT INTO mytable (x, y) VALUES(?, CURRENT_TIMESTAMP)', $x

sql_interp 'INSERT INTO mytable', [$x, sql(\$y, '*', \$z)];
# OUTPUT: 'INSERT INTO mytable VALUES(?, ? * ?)', $x, $y, $z

sql_var

my $sqlvar = sql_var($value_ref, type => $sql_type, %params);

sql_var() create an sql_var object (of type SQL::Interpolate::Variable), which provides a general way to represent a binding variable along with metadata.

$value_ref - variable reference contained

$sql_type - any DBI SQL_DATA_TYPE (e.g. SQL_INTEGER). Optional. Default is undef.

Any other named parameters (%params) passed in will be saved into the object as attributes.

sql_var objects are useful only in special cases where additional information should be tagged onto the variable. For example, DBI allows bind variables to be given an explicit type:

my ($sql, @bind) = sql_interp 'SELECT * FROM mytable WHERE',
    'x=', \$x, 'AND y=', sql_var(\$y, SQL_VARCHAR), 'AND z IN',
    sql_var([1, 2], SQL_INTEGER);
# RESULT: @bind =
#   ([$x, sql_var(\$x)], [$y, sql_var(\$y, type => SQL_VARCHAR)],
#    [1, sql_var([1, 2], type => SQL_INTEGER)],
#    [2, sql_var([1, 2], type => SQL_INTEGER)]);

If the interpolation list contains at least one sql_var object, then all the variable references are transparently converted into sql_var objects, and the elements of @bind take a special form: an arrayref consisting of the bind value and the sql_var object that generated the bind value. Note that a single sql_var holding an aggregate (arrayref or hashref) may generate multiple bind values. See "ADDITIONAL EXAMPLES" for example usage.

new

my $interp = SQL::Interpolate->new([$dbh|$filter]...);
@result = $interp->sql_interp([$dbh|$filter]...);

Creates a new SQL::Interpolate object, which can configure the interpolation process.

The arguments can be

- $dbh - zero or one DBI database handle.

- $filter - zero or more SQL filters (derived from SQL::Interpolate::SQLFilter).

The OO interface often is not needed, but it is useful if you need to configure the behavior of many calls to sql_interp, such as when using some macros.

make_sql_interp

my $sql_interp = make_sql_interp(@params);          # functional
my $sql_interp = $interp->make_sql_interp(@params); # OO

Creates a closure that wraps the sql_interp function such that the parameters passed to the sql_interp consist of @params following by the parameters passed to the closure. This function is typically used to eliminate a need to always pass in a database handle into sql_interp:

my $interp = make_sql_interp($dbh);

my ($sql, @bind) = $interp->(...);

Exports and use parameters

TRACE_SQL

To enable tracing on sql_interp, do

use SQL::Interpolate TRACE_SQL => 1;

The generated SQL statements and bind values of all sql_interp calls will be sent to STDERR.

DEBUG:interp[sql=INSERT INTO mytable VALUES(?),bind=5]
EXPORTS
use SQL::Interpolate qw(:all);

':all' exports these functions: make_sql_interp, sql_interp, sql_var, and sql.

DEPENDENCIES

This module has no major dependencies. If using the optional modules, see their DEPENDENCIES sections.

ADDITIONAL EXAMPLES

These are more advanced examples.

Preparing and reusing a statement handle

my $sth;
for my $href (@array_of_hashrefs) {
   my @list = ('SELECT * FROM mytable WHERE', $href);
   my ($sql, @bind) = sql_interp @list;
   die 'ASSERT' if $sth && $sth->{Statement} ne $sql;
   $sth = $dbh->prepare($sql) unless $sth;
   $sth->execute(@list);
   $sth->fetchall_arrayref();
}

The above code requires that $sql never changes. If $sql does change, you would have to prepare a new statement handle. DBIx::Interpolate implements a streamlined solution that caches statement handles.

Binding variables types (DBI bind_param)

my ($sql, @bind) = sql_interp 'SELECT * FROM mytable WHERE',
    'x=', \$x, 'AND y=', sql_var(\$y, SQL_VARCHAR), 'AND z IN',
    sql_var([1, 2], SQL_INTEGER);
# RESULT:
#   @bind = ([$x, sql_var(\$x)], [$y, sql_var(\$y, type => SQL_VARCHAR)],
#            [1, sql_var([1, 2], type => SQL_INTEGER)],
#            [2, sql_var([1, 2], type => SQL_INTEGER)]);
die 'ASSERT' if ref $bind[0] ne 'ARRAY';
my $sth = $dbh->prepare($sql);
my $idx = 1;
for my $var (@bind) {
    $sth->bind_param($idx++, $var->[0], $var->[1]->{type});
}
$sth->execute();
my $ret = $sth->selectall_arrayref();

This kludge is similar to the approach in SQL::Abstract's bindtype. DBIx::Interpolate provides a simpler way of handling bind_type.

DESIGN NOTES

The section covers the design choices used in this module.

Philosophy and requirements

These principles have guided the design of SQL-Interpolate.

The core module (SQL::Interpolate) should be simple and not try to do too much. (Mark Stosberg) SQL-Interpolate has one central function, sql_interp, which is relatively simple and reusable in itself such that all other functionality is built upon it. Complicated (macro) and less robust (source filtering) capabilities have been extracted out of the core module and into optional modules. Source filtering, for example, is optional and off by default since many distrust source filtering and fear it will cause bugs that are especially difficult to debug because "it's not Perl anymore."

The bottleneck is the database rather than Perl. This module necessarily imposes some overhead, largely due to the added string and regex processing. The author has not quantified this overhead but expects it to be low compared to database concerns such as disk access and query processing and network concerns such as latency. It may be possible to avoid rerunning sql_interp when only the binding variables change (e.g. my ($sql, $bindobj) = sql_interp(...); @bind = $bindobj->(x => 1); @bind = $bindobj->(x => 2)), but this is probably does not provide much benefit.

The query language is SQL. There are other modules (such as SQL::Abstract) that abstract the SQL language behind either object-oriented (OO) method calls and/or Perl data structures (hashes and arrays). The former may be undesirable in some cases since it replaces one language with another and hides the full capabilities and expressiveness of your database's native SQL language. The latter may load too much meaning into the syntax of "{, "[" and "\" thereby rendering the meaning less clear:

SQL::Abstract example:
%where = (lname => {like => '%son%'},
          age   => [-and => {'>=', 10}, {'<=', 20}])
Plain SQL:
"lname LIKE '%son' AND (age >= 10 AND age <= 20)"

In contrast, SQL::Interpolate does not abstract away your SQL but rather makes it easier to interpolate Perl variables into your SQL. Now, SQL::Interpolate does load some meaning into "{, "[" and "\", but we try to limit its use to obvious cases as justified below. Since your raw SQL is exposed, you can use your particular dialect of SQL, assuming the variable interpolation rules are relatively normal. Database independence is a worthy goal, but it can be quite difficult to achieve and is beyond the scope of SQL::Interpolate (though you might wish to build such features on-top-of SQL::Interpolate).

Do-what-I-mean (DWIM) and satisfy the most common case. The syntax is intended to be natural and terse for the most common cases. This is demonstrated in the examples.

Now, it may be a bit inconsistent that a hashref has two meanings. The hashref in ('WHERE', \%hash) represents a logical AND-equal construction, whereas the hashref in ('INSERT INTO mytable', \%hash) and ('UPDATE mytable SET', \%hash) represents a tuple or portion of it. However, there is little ambiguity since a swap of the two meanings results in illogical statements. There is a limited number of plausible meanings and these constructions, and these two are the most common and useful ones in practice. Admittedly, the former case might alternately be understood as an logical OR (rather than AND) construction, but it the AND construction is more common in a WHERE clause and a natural "Do What I Mean." (Similarly, a query "Perl MySQL" posed to a search engine usually implies "Perl AND MySQL" not "Perl OR MySQL.) In the latter interpretation of \%hash, the hashref very well models a tuple that is more named rather than ordered.

Using an arrayref [x => $x, y => $y] rather than a hashref for the AND'ed construction could work just as well, and it allows duplicate keys and non-scalar keys. However, SQL::Interpolate reserves [...] for future use. SQL::Interpolate interprets an arrayref inside a hashref such as {x => \@y, ...} as an 'x IN y' construction. This was independently suggested by a number of people and unlikely to be confused with the 'x = y' since and x and y have different dimensions (one is scalar and the other is a vector).

It may be a bit inconsistent that scalars inside hashrefs and arrayrefs are interpreted as binding variables rather than SQL as is the case outside.

'WHERE', 'x = ', \$x  # variables outside must be referenced
'WHERE', {x => $x}    # variables inside should not be referenced
'WHERE', [$x,$y]      # variables inside should not be referenced

However, this not too much a stretch of logicality, and the alternatives are not pretty and do not satisfy the commonest case. Consider:

'WHERE', {x => \$x, y => \$y, z => 'CURRENT_TIMESTAMP'}
'WHERE x IN', [\1, \2, \3]
'WHERE x IN', \\@colors  # double referencing

Exceptions from the commonest case require sql().

Limitations / characteristics

This module is still a bit under development, so interfaces could change some, particularly in the more esoteric features. Still, it is expected you will find this module quite stable, robust, tested, simple, flexible, and well documented.

If you're new to this module, it's a good idea to examine the generated SQL (e.g. the TRACE_SQL option) to ensure you're getting what you think you're getting. Be careful to reference the variables you interpolate to prevent SQL injection (see discussion in "sql_interp").

This module does not parse your SQL fragments except to the extent required for variable interpolation, so it does not guarantee that the generated SQL is valid but leaves that responsibility to your database. This is intentional so that it will works well even with non-standard SQL dialects. Some types of interpolation are context-sensitive and involve examination of your SQL fragments. The examination could fail on obscure syntax, but it is often simple and robust. Look at the examples to see the types of interpolation that are accepted, and if doubt, examine the SQL output yourself (use Data::Dumper to examine the output of sql_interp or enable the TRACE_SQL option) or look at the source code of sql_interp. If needed, you can disable context sensitivity by inserting a null-string before a variable.

A few things are just not possible with the ('WHERE', \%hashref) syntax, so in such case, use a more direct syntax:

# ok--direct syntax
sql_interp '...WHERE', {x => $x, y => $y}, 'AND y = z';
# bad--trying to impose a hashref but keys must be scalars and be unique
sql_interp '...WHERE',
    {sql_var(\$x) => sql('x'), y => $y, y => sql('z')};

In the cases where this module parses or generates SQL fragments, this module should work for many databases, but its been tested mostly on MySQL and PostgreSQL. Please information the author of any incompatibilities.

Proposed enhancements

The following enhancements to SQL::Interpolate have been proposed (not all may be implemented). The most important suggestions are generally listed at top.

A tutorial could be useful. (Wojciech)

This might be useful:

IN:  'bla IN', [undef, 2]
OUT: '(blah IS NULL OR bla IN (?))', 2
# possilby also: {bla => [undef, 2]}

Possibly support result sets inside the hashref logical-AND construct:

IN:  {x => [[1],[2]]}
OUT: 'x IN (SELECT ? UNION ALL SELECT ?)', 1, 2

Possibly support naming columns on table references when expressed as an arrayref of arrayrefs. Denoting this might require a typed object:

IN:  'SELECT * FROM', sql_table($vv, ['x', 'y'])
OUT: 'SELECT * FROM
     (SELECT ? AS x, ? AS y UNION ALL SELECT ?, ?) as t001',
     map {@$_} @$v

The following additional type of INSERT might be supported (markt):

# possible syntax
IN:  'INSERT INTO temp (id,val)', [1,2]
IN:  'INSERT INTO temp', sql_row([1, 2], ['id', 'val'])
IN:  'INSERT INTO temp', sql_row(id => 1, val => 2)
# note: an equivalent is currently possible with result sets:
IN:  'INSERT INTO temp (id,val)', [[1,2]]
OUT: 'INSERT INTO temp (id,val) (SELECT 1, 2)'

Possibly support MySQL and DB2 (any other databases?) style multi-row INSERT syntax (unlikely):

# MySQL
INSERT INTO table (a,b,c) VALUES (1,2,3),(4,5,6)
# But this might not be needed since it can be done already
# with result sets (e.g. MySQL and PostgreSQL):
INSERT INTO table (SELECT 1,2,3 UNION SELECT 4,5,6)

Support for placeholders might be added for cases when placing the variable references in-line is inconvenient or not desired (similar to SQL::String):

IN:  'SELECT * FROM mytable WHERE x = ? and y = ?', \$x, \$y
OUT: 'SELECT * FROM mytable WHERE x = ? and y = ?', $x, $y

IN:  'SELECT * FROM mytable WHERE x = :x and y = :y',
     {x => $x, y => $y}
OUT: 'SELECT * FROM mytable WHERE x = ? and y = ?', $x, $y

Similarly, named placeholders might be supported (possibly via sql_var()):

'SELECT * FROM mytable WHERE',
  'x=', sql_var('color'), 'and', {val => sql_var('weight')},
  sql_val(weight => 100), sql_val(color => 'blue')

Another idea to to add some methods to the sql() objects that allow sql_interp to be called in a more object-oriented manner (like in SQL::String):

my @bind = sql('WHERE x=', \$x)->get_bind();
my $sql  = sql('WHERE x=', \$x)->get_sql();

However, it's not certain that those methods would be used that much.

Possibly allow context to be explicitly specified:

IN:  'TABLEREF', [[1]]
OUT: '(SELECT ?) as tbl0', 1
IN:  'RESULTSET', [[1]]
OUT: '(SELECT ?)', 1
IN:  'ANDCLAUSE', {x => 1}
OUT: '(x = ?)', 1
IN:  'TUPLE', {x => 1}
OUT: 'x = ?', 1
# or possibly use macros
IN:  sql_tableref( [[1]] )
OUT: '(SELECT ?) as tbl0', 1

sql_and and sql_or macros might support hashrefs as well (the former for clarity and the latter for terseness):

'SELECT * FROM mytable WHERE', {x => 2, y => 5}
'SELECT * FROM mytable WHERE', sql_and {x => 2, y => 5} # same as above
'SELECT * FROM mytable WHERE', sql_or {x => 2, y => 5}

Logical operations might be supported as follows (e.g. junctions). This might be rejected due to unintuitive syntax.

'SELECT * FROM mytable WHERE', {
  x => sql_not(2),   # x <> 2
  y => 5
}
'SELECT * FROM mytable WHERE', sql_or { # might be difficult to read
  x => sql_or(2, 3, sql_and(4, 5)),
  y => 5
}
# or
'SELECT * FROM mytable WHERE', {
  sql_not(x => 2),   # x <> 2
  sql_lt(y => 5),    # y <  5
}

Support for tuples (e.g. MySQL/PostgreSQL) might be added, but this is probably too uncommon to be worthwhile to implement:

INPUT:  'SELECT * FROM edge WHERE', {'(sid, did)' => [5, 1]}
OUTPUT: 'SELECT * FROM edge WHERE (sid, did) = (?, ?)', 5, 1

INPUT:  'SELECT * FROM edge WHERE', {'(sid, did)' => [[5, 1], [2, 3]]}
OUTPUT: 'SELECT * FROM edge WHERE (sid, did) IN ((5,1),(2,3))',
        5, 1, 2, 3  # MySQL (not PostgreSQL)
OUTPUT: 'SELECT * FROM edge WHERE (sid, did) IN (SELECT ?,? UNION ALL ?,?)',
        5, 1, 2, 3  # MySQL or PostgreSQL

Implementation notes

Oracle (unlike MySQL) does not allow 'WHERE id = 5 and 1' nor 'WHERE id = 5 or 0'. SQL::Interpolate therefore generates the more portable 'WHERE id = 5 and 1=1' and 'WHERE id = 5 or 1=0'.

CONTRIBUTORS

David Manura (http://math2.org/david/contact) (author). Feedback incorporated from Mark Stosberg (http://mark.stosberg.com/) (recommended simplifying the core module, simplified the docs, and provided a bunch of other highly useful feedback), Mark Tiefenbruck (syntax), Wojciech Pietron (Oracle compat), Jim Chromie (DBIx::Interpolate idea), Juerd Waalboer, Terrence Brannon (early feedback), and others.

FEEDBACK

Bug reports and comments on the design are most welcome. rt.cpan.org can be used for specific bug reporting, but you may also use the discussion list (http://www.cpanforum.com/dist/SQL-Interpolate) or contact me directly (http://math2.org/david/contact). See also the project page at http://math2.org/sql_interpolate.

LEGAL

Copyright (c) 2003-2005, David Manura. This module is free software. It may be used, redistributed and/or modified under the same terms as Perl itself. See http://www.perl.com/perl/misc/Artistic.html.

SEE ALSO

Other modules in this distribution

DBIx::Interpolate extends this module slightly, allowing certain DBI methods to accept an sql_interp-like interpolation list rather than the traditional ($statement, \%attr, @bind_values)-like parameter list.

SQL::Interpolate::Filter streamlines the the SQL::Interpolate syntax with direct SQL interpolation analogous to Perl string interpolation.

SQL::Interpolate::Macro provides an assortment of macros and SQL filters, which allow you to write simpler, more robust, and possibly more portable queries.

SQL::Abstract shares with SQL::Interpolate the purpose of making SQL generation easier. SQL::Abstract differs in that it expresses queries in terms of OO method calls. It's syntax may impair readability because it uses the subtle difference between a brace and bracket to denote the difference between AND and OR in a query (the user can change whether a bracket implies "AND" or "OR"). Some complex where clauses are difficult or impossible with SQL::Abstract. SQL::Interpolate gives the author more direct access to the underlying SQL. This permits using the full expressivity of the database query language.

DBIx::Simple strives to simplify SQL generation as well as the data structures returned from DBI. SQL::Interpolate and the related modules don't try to handle DBI's results at all. DBIx::Simple currently can use SQL::Abstract to help generate SQL, and it may be possible in the future for the same to be done with SQL::Interpolate.

Class::DBI is a popular "complete" solution for abstract database access through an OO interface. It currently has a plugin called Class::DBI::AbstractSearch that allows it to use SQL::Abstract to generate SQL. It's possible that SQL::Interpolate could be integrated with it as well.

SQL::Preproc provides an "embedded SQL" approach where the Perl and SQL languages are extended (via source filtering) to support interwoven Perl and SQL. The syntax supports interpolating Perl variables into SQL and passing query results back into Perl variables. In contrast, SQL::Interpolate extends neither SQL nor Perl (except with the optional source filtering module), and it deals only with interpolating Perl variables into queries, whereas returning variables from queries is the job of something like DBI, DBIx::Interpolate, or DBIx::Simple.

SQL::String shares a number of similiarities to SQL::Interpolate but it is more rudimentary. Both let you combine "chunks" of SQL that have their parameters attached to them and then transform it into an SQL string and list of bind parameters suitable for passing to DBI.

SQL Interpolate Project Page: http://math2.org/sql_interpolate .

Full example code - Meset::MessageBoard in Meset (http://math2.org/meset).