Why not adopt me?
NAME
Hash::MultiKey - hashes whose keys can be multiple
SYNOPSIS
use Hash::MultiKey;
# tie first
tie %hmk, 'Hash::MultiKey';
# store
$hmk{['foo', 'bar', 'baz']} = 1;
# fetch
$v = $hmk{['foo', 'bar', 'baz']};
# exists
exists $hmk{['foo', 'bar', 'baz']}; # true
# each
while (($mk, $v) = each %hmk) {
@keys = @$mk;
# ...
}
# keys
foreach $mk (keys %hmk) {
@keys = @$mk;
# ...
}
# values
foreach $v (values %hmk) {
$v =~ s/foo/bar/g; # alias, modifies value in %hmk
# ...
}
# delete
$rmed_value = delete $hmk{['foo', 'bar', 'baz']};
# clear
%hmk = ();
# syntactic sugar, but see risks below
$hmk{'foo', 'bar', 'baz', 'zoo'} = 2;
# finally, untie
untie %hmk;
DESCRIPTION
Hash::MultiKey provides hashes that accept arrayrefs of strings as keys.
Two multi-keys are regarded as being equal if their contents are equal, there is no need to use the same reference to refer to the same hash entry:
$hmk{['foo', 'bar', 'baz']} = 1;
exists $hmk{['foo', 'bar', 'baz']}; # different arrayref, but true
A given hash can have multi-keys of different lengths:
$hmk{['foo']} = 1; # length 1
$hmk{['foo', 'bar', 'baz']} = 3; # length 3, no problem
In addition, multi-keys cannot be empty:
$hmk{[]} = 1; # ERROR
The next sections document how hash-related operations work in a multi-key hash. Some parts have been copied from standard documentation, since everything has standard semantics.
tie
Once you have tied a hash variable to Hash::MultiKey as in
tie my (%hmk), 'Hash::MultiKey';
you've got a hash whose keys are arrayrefs of strings. Having that in mind everything works as expected.
store
Assignment is this easy:
$hmk{['foo', 'bar', 'baz']} = 1;
fetch
So is fetching:
$v = $hmk{['foo', 'bar', 'baz']};
exists
Testing for existence works as usual:
exists $hmk{['foo', 'bar', 'baz']}; # true
Only whole multi-keys as they were used in assigments have entries. Sub-chains do not exist unless they were assigned some value.
For instance, ['foo']
is a sub-chain of ['foo', 'bar', 'baz']
, but if it has no entry in %hmk so far
exists $hmk{['foo']}; # false
each
As with everyday each()
, when called in list context returns a 2-element list consisting of the key and value for the next element of the hash, so that you can iterate over it. When called in scalar context, returns only the key for the next element in the hash.
Remember keys are arrayrefs of strings here:
while (($mk, $v) = each %hmk) {
@keys = @$mk;
# ...
}
The order in which entries are returned is guaranteed to be the same one as either the keys()
or values()
function would produce on the same (unmodified) hash.
When the hash is entirely read, a null array is returned in list context (which when assigned produces a false (0) value), and undef
in scalar context. The next call to each()
after that will start iterating again.
There is a single iterator for each hash, shared by all each()
, keys()
, and values()
function calls in the program.
Adding or deleting entries while we're iterating over the hash results in undefined behaviour. Nevertheless, it is always safe to delete the item most recently returned by each()
, which means that the following code will work:
while (($mk, $v) = each %hmk) {
print "@$mk\n";
delete $hmk{$mk}; # this is safe
}
keys
Returns a list consisting of all the keys of the named hash. (In scalar context, returns the number of keys.) The keys are returned in an apparently random order. The actual random order is subject to change in future versions of perl, but it is guaranteed to be the same order as either the values()
or each()
function produces (given that the hash has not been modified). As a side effect, it resets hash's iterator.
Remember keys are arrayrefs of strings here:
foreach $mk (keys %hmk) {
@keys = @$mk;
# ...
}
There is a single iterator for each hash, shared by all each()
, keys()
, and values()
function calls in the program.
The returned values are copies of the original keys in the hash, so modifying them will not affect the original hash. Compare values()
.
values
Returns a list consisting of all the values of the named hash. (In a scalar context, returns the number of values.) The values are returned in an apparently random order. The actual random order is subject to change in future versions of perl, but it is guaranteed to be the same order as either the keys()
or each()
function would produce on the same (unmodified) hash.
Note that the values are not copied, which means modifying them will modify the contents of the hash:
s/foo/bar/g foreach values %hmk; # modifies %hmk's values
s/foo/bar/g foreach @hash{keys %hash}; # same
As a side effect, calling values()
resets hash's internal iterator.
There is a single iterator for each hash, shared by all each()
, keys()
, and values()
function calls in the program.
delete
Deletes the specified element(s) from the hash. Returns each element so deleted or the undefined value if there was no such element.
The following (inefficiently) deletes all the values of %hmk:
foreach $mk (keys %hmk) {
delete $hmk{$mk};
}
And so do this:
delete @hmk{keys %hmk};
But both methods are slower than just assigning the empty list to %hmk:
%hmk = (); # clear %hmk, the efficient way
untie
Untie the variable when you're done:
untie %hmk;
SYNTACTIC SUGAR
Hash::MultiKey supports also this syntax:
$hash{'see', '$;', 'in', 'perldoc', 'perlvar'} = 1;
If the key is a string instead of an arrayref the underlying code splits it using $;
(see why in MOTIVATION) and from then on the key is an arrayref as any true multi-key. Thus, the assigment above is equivalent to
$hash{['see', '$;', 'in', 'perldoc', 'perlvar']} = 1;
once it has been processed.
You don't need to split the string back while iterating with each()
or keys()
, it already comes as an arrayref of strings.
Nevertheless take into account that this is slower, and broken if any of the components contains $;
. It is supported just for consistency's sake.
MOTIVATION
Perl comes already with some support for hashes with multi-keys. As you surely know, if perl sees
$hash{'foo', 'bar', 'baz'} = 1;
it joins ('foo', 'bar', 'baz')
with $;
to obtain the actual key, thus resulting in a string. Then you retrieve the components of the multi-key like this:
while (($k, $v) = each %hash) {
@keys = $k eq '' ? ('') : split /$;/, $k, -1;
# ...
}
Since $;
is \034
by default, a non-printable character, this is often enough.
Sometimes, however, that's not the most convenient way to work with multi-keys. For instance, that magic join doesn't work with arrays:
@array = ('foo', 'bar', 'baz');
$hash{@array} = 1; # WARNING, @array evaluated in scalar context!
You could be dealing with binary data. Or you could be writing a public module that uses user input in such a hash and don't want to rely on input not coming with $;
, or don't want to document such an obscure, gratuitous, and implementation dependent constraint.
In such cases, Hash::MultiKey can help.
AUTHORS
Xavier Noria (FXN), Benjamin Goldberg (GOLDBB).
THANKS
Iain Truskett (SPOON) kindly checked whether this module works in perl 5.005 and found out the use of "/" in pack()
, introduced in perl 5.006, prevents that.
COPYRIGHT and LICENSE
Copyright (C) 2003, Xavier Noria <fxn@cpan.org>. All rights reserved. This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.