NAME
Test::LectroTest::Generator - Random value generators and combinators
SYNOPSIS
use Test::LectroTest::Generator qw(:common :combinators);
my $int_gen = Int;
my $pct_gen = Int( range=>[0,100] );
my $flt_gen = Float( range=>[0,1] );
my $bln_gen = Bool;
my $chr_gen = Char( charset=>"a-z" );
my $str_gen = String( charset=>"A-Z0-9", length=>[3,] );
my $ary_gen = List( Int(sized=>0) );
my $hsh_gen = Hash( $str_gen, $pct_gen );
my $uni_gen = Unit( "e" ); # always returns "e"
my $elm_gen = Elements("e1", "e2", "e3", "e4");
for my $sizing_guidance (1..100) {
my $i = $int_gen->generate( $sizing_guidance );
print "$i ";
}
print "\n";
# generates single digits
my $digit_gen = Elements( 0..9 ); # or Int(range=>[0,9],sized=>0)
# generates SSNs like "910-77-2236"
my $ssn_gen = Paste( Paste( ($digit_gen) x 3 ),
Paste( ($digit_gen) x 2 ),
Paste( ($digit_gen) x 4 ),
glue => "-" );
# print 10 SSNs
print( map {$ssn_gen->generate($_)."\n"} 1..10 );
my $english_dist_vowel_gen =
Frequency( [8.167,Unit("a")], [12.702,Unit("e")],
[6.996,Unit("i")], [ 7.507,Unit("o")],
[2.758,Unit("u")] );
# Source: http://www.csm.astate.edu/~rossa/datasec/frequency.html
DESCRIPTION
This module provides random value generators for common types and provides an interface and tools for creating your own generators. It also provides generator combinators that can be used to create more-complex generators by combining simple ones.
A generator is an object having a method generate
, which takes a single argument, size and returns a new random value. The generator interprets the size argument as guidance about the largest value it should create. Typically, smaller size values result in smaller generated values. Some generators ignore sizing guidance or can be told to ignore it via the sized modifier.
GENERATORS
The following functions create fully-formed generators, ready to use. Each generator has a generate
method that you can call to extract a new, random value from the generator.
- Int
-
my $gen = Int( range=>[0,9], sized=>0 );
Creates a generator for integer values, by default in the range [-32768,32767], inclusive, but this can be changed via the optional range modifier.
- Int( range=>[low, high] )
-
Causes the generated values to be constrained to the range [low, high], inclusive. By default, the range is [-32768, 32767].
- Int( sized=>bool )
-
If true (the default), constrains the absolute value of the generated integers to the sizing guidance provided to the
generate
method. Otherwise, the generated values are constrained only by the range.
- Float
-
my $gen = Float( range=>[-2.0,2.0], sized=>1 );
Creates a generator for floating-point values, by default in the range [-32768,32768), but this can be changed via the optional range modifier. By default Float generators are sized.
- Float( range=>[low, high] )
-
Causes the generated values to be constrained to the range [low, high). By default, the range is [0, 1). (Note that the high value itself can never be generated, but values infinitesimally close to it can.)
- Float( sized=>bool )
-
If true (the default), constrains the absolute value of the generated values to the sizing guidance provided to the
generate
method. Otherwise, the generated values are constrained only by the range.
- Bool
-
my $gen = Bool;
Creates a generator for boolean values: 0 for false, 1 for true. The generator ignores sizing guidance.
- Char
-
my $gen = Char( charset=>"A-Za-z0-9_" );
Creates a generator for characters. By default the characters are in the ASCII range [0,127], inclusive, but this behavior can be changed with the charset modifier:
- Char( charset=>[cset] )
-
Characters will be drawn from the character set given by the character-set specification cset. The syntax of cset is similar the Perl
tr
builtin and is a string comprised of characters and character ranges:- c
-
Adds the character c to the set.
- c-d
-
Adds the characters in the range c through d (inclusive) to the set. Note: If c is lexicographically greater than d, the range is empty, and no characters will be added to the set.
Examples:
- charset=>"abcdwxyz"
-
The characters "a", "b", "c", "d", "w", "x", "y", and "z" are in the set.
- charset=>"a-dx-z"
-
Shorter version of the previous example.
- charset=>"\x00-\x7f"
-
The ASCII character set.
- charset=>"-_A-Za-z0-9"
-
The character set contains "-", "_", upper- and lower-case ASCII letters, and the digits 0-9. Notice that the dash must occur first so that it is not misinterpreted as denoting a range of characters.
- List(elemgen)
-
my $gen = List( Bool, length=>[1,10] );
Creates a generator for lists (which are returned as array refs). The elements of the lists are generated by the generator given as elemgen. The lengths of the generated lists are constrained by sizing guidance at the time of generation. You can override the default sizing behavior using the optional length modifier:
When the list generator calls the element generator, it divides the sizing guidance by the length of the list. For example, if the list being generated will have 7 elements, when the list generator calls the element generator to generate each element, it will scale the sizing guidance by 1/7. In this way the sizing guidance provides a rough constraint on the total number of elements produced, regardless of the depth of the list structure being generated.
- List( ..., length=>N )
-
Generated lists are exactly length N.
- List( ..., length=>[M,] )
-
Generated lists are at least length M. (Maximum length is constrained by sizing factor.)
- List( ..., length=>[M,N] )
-
Generated lists are of length between M and N, inclusive. Sizing guidance is ignored.
Advanced Note: If more than one elemgen is given, they will be used in turn to create successive elements. In this case, the length of the list will be multiplied by the number of generators given. For example, providing two generators will create double-length lists.
- Hash(keygen, valgen)
-
my $gen = Hash( String( charset=>"A-Z", length=>3 ), Float( range=>[0.0, 100.0] );
Creates a generator for hashes (which are returned as hash refs). The keys of the hash are generated by the generator given as keygen, and the values are generated by the generator valgen.
The Hash generator takes an optional length modifier that specifies the desired hash length (= number of keys):
- String
-
my $gen = String( length=>[3,], charset=>"A-Z" );
Creates a generator for strings. By default the strings will be drawn from the ASCII character set (0 through 127) and be of length constrained by the sizing factor. Both defaults can be changed using modifiers:
- String( charset=>[cset] )
-
Characters will be drawn from the character set given by the character-set specification cset. The syntax of cset is similar the Perl
tr
builtin and is a string comprised of characters and character ranges. See Char for a full description. - String( length=>length-spec )
-
Specifies the desired length of generated strings, using the same length-spec syntax as for the List generator.
- Elements(e1, e2, ...)
-
my $gen = Elements( "alpha", "beta", "gamma" );
Creates a generator that choses among the given elements e1, e2, ... with equal probability. Each call to the
generate
method will return one of the element values. Sizing guidance has no effect on this generator.Note: This generator builder does not accept modifiers. If you pass any, they will be interpreted as elements to be added to the pool from which the generator randomly selects, which is probably not what you want.
- Unit(e)
-
my $gen = Unit( "alpha" );
Creates a generator that always returns the value e. Not too useful on its own but can be handy as a building block for combinators to chew on. Naturally, sizing guidance has no effect on this generator.
Note: This generator builder does not accept modifiers.
GENERATOR COMBINATORS
The following combinators allow you to build more complicated generators from simpler ones.
- Paste(gens..., glue=>str)
-
my $gen = Paste( (String(charset=>"0-9",length=>4)) x 4, glue => " " ); # make credit-card numbers
Creates a combined generator that generates values by joining the values generated by each of the supplied sub-generators gens. The resulting string is returned. The values are joined using the given glue string str. If no glue modifier is provided, the default glue is the empty string.
The sizing guidance given to the combined generator will be passed unchanged to each of the sub-generators.
- OneOf(gens...)
-
my $gen = OneOf( Unit(0), List(Int,length=>3) );
Creates a combined generator that generates each value by selecting at random (with equal probability) one of the sub-generators in gens and using that generator to generate the output value.
The sizing guidance given to the combined generator will be passed unchanged to the selected sub-generator.
Note: This combinator does not accept modifiers.
- Frequency([freq1, gen1], [freq2, gen2], ...)
-
my $gen = Frequency( [50, Unit("common" )], [35, Unit("less common")], [15, Unit("uncommon" )] );
Creates a combined generator that generates each value by selecting at random one of the generators gen1 or gen2 or ... and using that generator to generate the output value. Each generator is selected with probability proportional to its associated frequency. (If all of the given frequencies are the same, the Frequency combinator effectively becomes OneOf.) The frequencies can be any non-negative numerical values you want and will be normalized to a 0-to-1 scale internally. At least one frequency must be greater than zero.
The sizing guidance given to the combined generator will be passed unchanged to the selected sub-generator.
Note: This combinator does not accept modifiers.
- Sized(BLOCK, gen)
-
my $gen = Sized { 2 * $_[0] + 3 } List(Int);
Creates a generator that adjusts sizing guidance by passing it through the function given in BLOCK and then calls the generator gen using the adjusted guidance.
Note: This combinator does not accept modifiers.
SIMPLE EXAMPLES
use strict;
use Test::LectroTest::Generator qw(:common);
show("Ints (sized by default)", Int);
show("Floats (unsized by default)", Float);
show("Percentages (unsized)",
Int( range=>[0,100], sized=>0 ));
show("Lists (sized by default) of Ints (unsized) in [0,10]",
List( Int( sized=>0, range=>[0,10] ) ));
show("Uppercase-alpha identifiers at least 3 chars long",
String( length=>[3,], range=>[ord'A',ord'Z'] ));
show("Hashes (sized by default) of form AAA=>Digit",
Hash( String( length=>3, range=>[ord'A',ord'Z'] ),
Int( sized=>0, range=>[0,9] ) ));
sub show {
print "\n", shift(), "\n";
my ($gen) = @_;
for (1..10) {
my $val = $gen->generate($_);
printf "Size %2d: ", $_;
if (ref $val eq "HASH") {
my @pairs = map {"$_=>$val->{$_}"} keys %$val;
print "{ @pairs }";
}
elsif (ref $val eq "ARRAY") {
print "[ @$val ]"
}
else {
print $val;
}
print "\n";
}
}
ADVANCED EXAMPLES
For these exmaples we use Data::Dumper
to inspect the data structures we generate. Also, we import not only the common generator constructors (like Int) but also the generic Gen constructor, which lets us build on-the-fly generators out of blocks.
use Data::Dumper;
use Test::LectroTest::Generator qw(:common Gen);
First, here's a recipe for building a list of lists of integers:
my $loloi_gen = List( List( Int(sized=>0) ) );
print Dumper($loloi_gen->generate(10));
You may want to run the example several times to get a feel for the distribution of the generated output.
Now, a more complicated example. Here we build sized trees of random depth using a recursive set of generators.
my $tree_gen = do {
my $density = 0.5;
my $leaf_gen = Int( sized=>0 );
my $tree_helper = \1;
my $branch_gen = List( Gen { $$tree_helper->generate(@_) } );
$tree_helper = \Gen {
my ($size) = @_;
return rand($size) < $density
? $leaf_gen->generate($size)
: $branch_gen->generate($size + 1);
};
$$tree_helper;
};
print Dumper($tree_gen->generate(30));
We define a tree as either a leaf or a branch, and we randomly decide between the two at each node in the growing tree. Leaves are just integers and become more likely when the sizing guidance diminishes (which happens as we go deeper). The code uses $density
as a control knob for leaf density. (Try re-running the above code after changing the value of $density
. Try 0, 1, and 2.) Branches, on the other hand, are lists of trees. Because branches generate trees, and trees generate branches, we use a reference trick to set up the mutually recursive relationship. This we encapsulate within a do block for tidiness.
LECTROTEST HOME
The LectroTest home is http://community.moertel.com/LectroTest. There you will find more documentation, presentations, a wiki, and other helpful LectroTest-related resources. It's also the best place to ask questions.
AUTHOR
Tom Moertel (tom@moertel.com)
INSPIRATION
The LectroTest project was inspired by Haskell's fabulous QuickCheck module by Koen Claessen and John Hughes: http://www.cs.chalmers.se/~rjmh/QuickCheck/.
COPYRIGHT and LICENSE
Copyright 2004 by Thomas G Moertel. All rights reserved.
This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.