NAME
Module::Generic::Scalar - String Manipulation Object Class
SYNOPSIS
my $s = Module::Generic::Scalar->new( "John Doe" );
print( $s->substr( 0, 4 ), "\n" );
# prints: John
$s->append( 'more data' );
# $a is now a Module::Generic::Array object
my $a = $s->as_array;
# $bool is now a Module::Generic::Boolean object
# with value is true or false depending on underlying value of $s
my $bool = $s->as_boolean;
# $a is now a Module::Generic::Number object
my $n = $s->as_number
# raw string
my $str = $s->as_string;
print( "Hello $s\n" );
# Hello John Doe
my $str = Module::Generic::Scalar->new();
$a->callback( add => sub
{
my( $new ) = @_;
return unless( $$new eq $approved_string );
return(1);
});
$str->append( $some_string );
# or
$str .= $some_string;
$s->capitalise;
$s->chomp;
$s->chop;
$s->clone;
my $crypted = $s->crypt( $salt );
$s->defined;
$s->empty;
$s->fc( $that );
my $hex = $s->hex;
my $index = $s->index;
$s->is_alpha;
$s->is_alpha_numeric;
$s->is_empty;
$s->is_lower;
$s->is_numeric;
$s->is_upper;
my $s = Module::Generic::Scalar->new( 'Jack John Paul Peter' );
# Takes the string, split it by space (now an array), join it by comma (now a scalar) and rejoin it with more strings
say $s->split( qr/[[:blank:]]+/ )->join( ', ' )->join( ', ', qw( Gabriel Raphael Emmanuel ) );
# prints: Jack, John, Paul, Peter, Gabriel, Raphael, Emmanuel
$s->lc;
$s->lcfirst;
$s->left;
$s->length;
my $s = Module::Generic::Scalar->new( "I disapprove of what you say, but I will defend to the death your right to say it" );
print( "Matches? ", $s->like( qr/\bapprove[[:blank:]\h]+what\b/ ) ? 'yes' : 'no', "\n" ); # print yes
if( $s->like( qr/[[:blank:]\h]+/ ) )
{
# Do something
}
# or
if( my $rv = $s->like( qr/([[:blank:]\h]+)/ )->result )
{
my $first_match = $rv->capture->first;
# Do something
}
$s->lower;
$s->ltrim;
# Remove all kind of leading whitespaces
$s->ltrim( qr/[[:blank:]\h]+/ );
# $s is "Hello world"
$s->match( 'world' ); # pass
$s->match( qr/WORLD/i ); # pass
$s->match( 'monde' ); # obviously fail
if( $s->match( qr/[[:blank:]\h]+/ ) )
{
# Do something
}
if( my $rv = $s->match( qr/([[:blank:]\h]+)/ )->result )
{
my $first_match = $rv->capture->first;
# Do something
}
my $unpack = $unpack_data->unpack( "A10xA28xA8A*" )->object;
my $fh = $s->open;
$fh->print;
$fh->read;
$s->ord;
$s->pack;
$s->pad( 3, 'X' );
# XXXHello world
$s->padd( -3, 'X' );
# Hello worldXXX
$s->pos;
$s->$s->prepend( join( '', qw( Cogito ergo sum ) ) );
$s->quotemeta;
# $s is Hello world
my $rv = $s->replace( ' ', '_' ); # Hello_world
my $rv = $s->replace( qr/[[:blank:]\h]+/, '_' ); # Hello_world
if( my $rv = $s->replace( qr/([[:blank:]\h]+)/, '_' )->result )
{
my $first_match = $rv->capture->first;
# Do something
}
$s->reset;
$s->reverse;
Module::Generic::Scalar->new( "Hello world" )->right( 5 );
# will produce: world
$s->rindex;
# Remove all kind of trailing whitespaces
$s->rtrim( qr/[[:blank:]\h]+/ );
$s->scalar;
$s->set;
# Returns a Module::Generic::Array object
my $a = $s->split( /\n/ );
my $a = $s->split( qr/\n/ );
$s->sprintf;
$s->substr;
$s->tr;
$s->trim;
$s->uc;
$s->ucfirst;
print( $s->defined ? 'defined' : 'undefined', "\n" );
my $array = $s->unpack( "W" );
# Returns the ord() value as a Module::Generic::Array object
$s->unpack( "W" )->length;
# Object context. Ditto
my( $date, $desc, $income, $expend ) = $s->unpack( "A10xA27xA7A*" );
# Returns a regular list of values
$s->upper;
VERSION
v1.3.4
DESCRIPTION
The purpos of this calss/package is to provide an object-oriented approach to string manipulation.
The object is overloaded, so it returns the embedded string when used as a string.
print( "I met with $s\n" );
Would produce: I met with John Doe
METHODS
new
Provided with scalar reference, an array or just a regular string and this returns a new object.
If an array reference or an array-based object is provided like Module::Generic::Array, this will concatenate all the array elements
append
Provided with a string, and this will add it to the end of the string object.
# $s is 'Hello'
$s->append( ' world' );
# now this is: Hello world
as_array
Returns the current string value as an Module::Generic::Array object.
as_boolean
Returns a Module::Generic::Boolean object with its value set to true or false based on the value of the scalar object.
# $s is 1
$s->as_boolean; # sets to true
# $s is 0
$s->as_boolean; # sets to false
# $s is hello
$s->as_boolean: # sets to true
# $s is undefined or empty
$s->as_boolean: # sets to false
# etc...
Module::Generic::Boolean objects are very useful because they can be used in perl as o or 1 to indicate false or true, but when used in json, they are automatically converted to false
or true
as_number
Returns the current string as an Module::Generic::Number object.
as_string
Returns the object string as a string.
my $s = Module::Generic::Scalar->new( "Mary Jane" );
print( "Hello $s\n" );
# Hello Mary Jane
callback
Sets or gets a callback on the scalar object. You can set 2 types of callback: add
or remove
which will be called respectively when you add something to the scalar or when you remove something from it.
It does this by tieing the scalar to its internal class Module::Generic::Scalar::Tie
that will call the add
or remove
callback when data is being added or removed, no matter how, i.e. be it through the methods of this class, or by directly trying to add data to it:
my $str = Module::Generic::Scalar->new();
$a->callback( add => sub
{
my( $new ) = @_;
return unless( $$new eq $approved_string );
return(1);
});
$str->append( $some_string );
# or
$str .= $some_string;
In both cases above the string will be added, and if it were some other unapproved text, the data would not be added to the string.
So, the callback must return true to allow the data to be added, or undef
to refuse. Returning an empty string or 0 will not work to be considered a rejection even though it is a false value.
Both add
and remove
callbacks receives an hash reference as its sole arguments.
The add
callback receives an hash reference with the following properties:
- added
-
A scalar reference to the new value being set in replacement of the old one, so you can inspect it.
- removed
-
A scalar reference to the old value being replaced, so you can inspect it.
- type
-
The callback type:
add
The remove
callback works the same way, but for any operation that tries to remove data from the string. It receives an hash reference with the following properties:
- removed
- type
-
The callback type:
remove
my $str = Module::Generic::Scalar->new();
$str->callback( remove => sub
{
my( $old, $new ) = @_;
# Do some check to accept or reject
return(1); # always return true to accept
});
$str->substr( 3, 4, $some_other_text ); # Attempt to change the string
This example above would call the callback passing it the old and new string.
To replace a callback, simply add another one. It will replace the previous one.
You can get the callback code associated with a callback by calling callback
with one argument: the callback type.
my $code_ref = $str->callback( 'add' );
To remove a callback, pass undef
:
$str->callback( add => undef );
$str->callback( remove => undef );
By default, a scalar object is not tied to Module::Generic::Scalar::Tie
and no callback are set nor used. Be mindful when using callbacks since it will slow things down a little bit even though it is s very thin layer.
See perltie for more information on tieing data.
capitalise
Returns a new Module::Generic::Scalar object representing the string with the words capitalised, done in a smart way. That is the special words like and
, but
, if
, on
, or
, the
, to
, etc. are not capitalised.
This is based on the work done by John Gruber and Aristotle Pagaltzis
chomp
Just like "chomp" in perlfunc, this removes the trailing new lines in the string object, if any.
As per the perl documentation for "chomp" in perlfunc, "it returns the total number of characters removed from all its arguments."
chop
Just like "chop" in perlfunc, this removes the trailing character in the string object, no matter what it is, and returns the character chopped.
clone
Returns a copy of the object.
crypt
This takes a "salt" and returns an encrypted version of the string. See "crypt" in perlfunc. Note that this does not work well on some BSD systems.
defined
Returns true or false whether the string object contains a defined string, i.e. not undef
empty
Alias for "reset". This empties the scalar object, making it zero-byte in length.
error
Provided with an error message string, or an hash of parameters, and this will instantiate a new exception object and return undef
, or return an Module::Generic::Null object in object context, such as when it is chained.
See "error" in Module::Generic for more information.
fc
Just like "fc" in perlfunc, provided with a string, this enables comparison with casefolding.
To quote from the manual: "Casefolding is the process of mapping strings to a form where case differences are erased".
lc($this) eq lc($that) # Wrong!
# or
uc($this) eq uc($that) # Also wrong!
# or
$this =~ /^\Q$that\E\z/i # Right!
# And now
my $s = Module::Generic::Scalar( $this );
$s->fc( $that );
hex
Returns the hex value of the string.
index
Given a sub string and an optional position, and this returns the position at which the sub string was found. as a Module::Generic::Number object.
is_alpha
Returns true if the string contains only alphabetic characters, or else it returns false.
This uses perl's [[:alpha:]]
to test.
is_alpha_numeric
Returns true if the string contains only alphabetic or numeric characters, or else it returns false.
This uses perl's [[:alnum:]]
to test.
is_empty
Returns true if the string is zero in length, or else it returns false.
is_lower
Returns true if the string contains only lower case characters, or else it returns false.
This uses perl's [[:lower:]]
to test.
is_numeric
Returns true if the string contains only numeric characters, or else it returns false.
This uses "looks_like_number" in Scalar::Util
is_upper
Returns true if the string contains only upper case characters, or else it returns false.
This uses perl's [[:upper:]]
to test.
join
Takes a string as the delimiter and one or more elements and this will join the current string object and the other string elements provided using the string delimiter. For example:
my $s = Module::Generic::Scalar->new( 'Jack John Paul Peter' );
# Takes the string, split it by space (now an array), join it by comma (now a scalar) and rejoin it with more strings
say $s->split( qr/[[:blank:]]+/ )->join( ', ' )->join( ', ', qw( Gabriel Raphael Emmanuel ) );
# prints: Jack, John, Paul, Peter, Gabriel, Raphael, Emmanuel
Note that if, among the element passed to "join", an aray object is provided, it will not be expanded. Thus, if you do something like this:
my $a = Module::Generic::Array->new( [qw( John Peter )] );
say Module::Generic::Scalar->new( 'Paul' )->join( ', ', $a );
It wil lnot produce Paul, John, Peter
, but instead something like: Paul, Module::Generic::Array=ARRAY(0x5646f0116470)
That's because there is no way to guess if the user wanted to pass on a list or just an object and making that kind of assumption is dangerous.
What you woud want to do instead is:
say Module::Generic::Scalar->new( 'Paul' )->join( ', ', $a->list );
Or maybe
say Module::Generic::Scalar->new( 'Paul' )->join( ', ', @$a );
Of course, passing an overloaded object that has the stringification capability will transform it automatically into a string before being joined. Thus:
my $word1 = Module::Generic::Scalar->new( 'Hello' );
my $word2 = Module::Generic::Scalar->new( 'world' );
say $word1->join( ' ', $word2 );
will yield: Hello world
because Module::Generic::Scalar objects have overloaded stringification capability.
It returns the newly formed string as a new Module::Generic::Scalar object.
lc
Takes the current string object and returns a new Module::Generic::Scalar object with the string all in lower case.
lcfirst
Takes the current string object and returns a new Module::Generic::Scalar object with the first character of the string in lower case.
left
Provided with a number and this will get the chunk starting from the left of the string object.
Module::Generic::Scalar->new( "Hello world" )->left( 5 );
# will produce: Hello
See also "right"
length
This returns the length of the string, as a Module::Generic::Number object.
like
Provided with a string or a regular expression and this returns the value of the regular expression evaluation against the object string.
my $s = "I disapprove of what you say, but I will defend to the death your right to say it";
print( "Matches? ", $s->like( qr/\bapprove[[:blank:]\h]+what\b/ ) ? 'yes' : 'no', "\n" ); # print yes
If there is no match, it returns 0, but in object, list or scalar context or when there are matches, it returns a Module::Generic::RegexpCapture object. This object stringifies into the number of matches, so you can do:
if( $s->like( qr/[[:blank:]\h]+/ ) )
{
# Do something
}
or
if( my $rv = $s->like( qr/([[:blank:]\h]+)/ )->result )
{
my $first_match = $rv->capture->first;
# Do something
}
Regular expression can also be one from Regexp::Common
See also "match" and "replace"
lower
Alias for "lc"
ltrim
This removes any new line and space characters, i.e. \r
and \n
at the begining of the string.
It takes an optional argument that can be an alternative string to remove at the end of the sstring or a regular expression, such as one provided with "perlfunc/qr"
$s->ltrim( qr/[[:blank:]\h]+/ ); # Remove all kind of leading whitespaces
It returns the object itself for chaining.
Regular expression can also be one from Regexp::Common
See also "rtrim"
match
Provided with a string or a regular expression like the one created with "qr" in perlfunc and this returns true or false whether the string object matched or not.
# $s is "Hello world"
$s->match( 'world' ); # pass
$s->match( qr/WORLD/i ); # pass
$s->match( 'monde' ); # obviously fail
If there is no match, it returns 0, but in object, list or scalar context or when there are matches, it returns a Module::Generic::RegexpCapture object. This object stringifies into the number of matches, so you can do:
if( $s->match( qr/[[:blank:]\h]+/ ) )
{
# Do something
}
or
if( my $rv = $s->match( qr/([[:blank:]\h]+)/ )->result )
{
my $first_match = $rv->capture->first;
# Do something
}
Regular expression can also be one from Regexp::Common
object
Returns the current object. This is useful when chaining to force an object context. For example, with "unpack". In scalar context, "unpack", as per the "unpack" in perlfunc documentation will return the first element:
my $unpack = $unpack_data->unpack( "A10xA28xA8A*" );
But what if we wanted $unpack
to not be the first element, but the array object? For that, we would need to call "unpack" in object context:
my $unpack = $unpack_data->unpack( "A10xA28xA8A*" )->object;
open
Opens the scalar in read/write mode and returns an object with same capabilities as regular file handle.
ord
This returns the value of "ord" in perlfunc on the string, as a Module::Generic::Number object.
pack
Like "pack" in perlfunc, this takes a template and convert the string object. "The resulting string is the concatenation of the converted values."
It returns a new Module::Generic::Scalar object.
See also "pack" in Module::Generic::Array
pad
Provided with a number n and a string and this will create n instance of the string. If the number is positive, the string will be placed at the begining and if negative, it will be placed at the end
$s->pad( 3, 'X' );
# XXXHello world
$s->padd( -3, 'X' );
# Hello worldXXX
pass_error
If provided with an error object, this will set it in this package and pass it along to the caller.
If no error object is provided, this will merely return undef
, or an Module::Generic::Null object in object context, such as when it is chained.
See "pass_error" in Module::Generic for more information.
pos
This sets or gets the position inside the string object. See "pos" in perlfunc for detail about this.
prepend
Takes some data as its unique argument and prepend it to the scalar object underlying value.
If you want to pass multiple data, you will have to join them first:
$s->prepend( join( '', qw( Cogito ergo sum ) ) );
quotemeta
Given a string, this return a new Module::Generic::Scalar object with the given string characters escapeed with "quotemeta" in perlfunc.
replace
Provided with a string or a regular expression and a replacement string and this will replace all instance of the string or regular expression with the replacement string provided.
# $s is Hello world
my $rv = $s->replace( ' ', '_' ); # Hello_world
my $rv = $s->replace( qr/[[:blank:]\h]+/, '_' ); # Hello_world
If there is no match, it returns 0, but in object, list or scalar context or when there are matches, it returns a Module::Generic::RegexpCapture object. This object stringifies into the number of matches, so you can do:
if( $s->replace( qr/[[:blank:]\h]+/, '_' ) )
{
# Do something
}
or
if( my $rv = $s->replace( qr/([[:blank:]\h]+)/, '_' )->result )
{
my $first_match = $rv->capture->first;
# Do something
}
Regular expression can also be one from Regexp::Common
reset
This empty the string inside the object.
reverse
Given a string, this return a new Module::Generic::Scalar object with the given string cin reverse order.
right
Provided with a number and this will get the chunk starting from the right of the string object.
Module::Generic::Scalar->new( "Hello world" )->right( 5 );
# will produce: world
See also "left"
rindex
Given a sub string and an optional position, and this returns the position at which the sub string was found, starting from the end.
rtrim
This removes any new line and space characters, i.e. \r
and \n
at the end of the string.
It takes an optional argument that can be an alternative string to remove at the end of the sstring or a regular expression, such as one provided with "perlfunc/qr"
$s->rtrim( qr/[[:blank:]\h]+/ ); # Remove all kind of trailing whitespaces
It returns the object itself for chaining.
Regular expression can also be one from Regexp::Common
See also "ltrim"
scalar
Returns the string within this scalar object. This calls "as_string"
set
Provided with a scalar reference or scalar-based object like Module::Generic::Scalar or an array reference and this sets the current string.
This acts the exact same way as for "new", except it acts on the current object string.
split
Provided with a string or an expression and this returns the list in list context or, in scalar context, an array reference as an Module::Generic::Array object.
Be careful that you cannot just do like perl's original split such as:
my $a = $s->split( /\n/ );
Because /\n/
is not passed as an argument, i.e. it results in no argument being passed, so you do need to either provide the expression as "\n"
or as a regular expression:
my $a = $s->split( qr/\n/ );
It will warn you if no argument was provided.
Regular expression can also be one from Regexp::Common
sprintf
Provided with a list of arguments, and this replace the placeholders just like "sprintf" in perlfunc does.
substr
Provided with an offset, an optional length and an optional replacement string, and this modifies in-place the current object string and return a new Module::Generic::Scalar object of the string segment thus altered or removed or replaced.
See "substr" in perlfunc for more information.
$s = 'I disapprove of what you say, but I will defend to the death your right to say it'; # Evelyn Hall, "Friends of Voltaire"
$s->substr( 2, 13 );
# returns a new object containing 'disapprove of'
$s->substr( 2, 13, 'really do not approve' );
# returns a new object containing 'disapprove of' and the current object now contains:
# 'I really do not approve what you say, but I will defend to the death your right to say it'
tr
Provided with a search list and a replacement list and this will perform just like the perl core "tr" in perlfunc function.
It also accepts options like cdsr
and returns the resulting value.
trim
Provided with a target string or a regular expression, and this will remove any occurence of them in the string object.
uc
Takes the current string object and returns a new Module::Generic::Scalar object with the string all in upper case.
ucfirst
Takes the current string object and returns a new Module::Generic::Scalar object with the first character of the string in upper case.
undef
Sets the underlying string object to undef.
This would make
print( $s->defined ? 'defined' : 'undefined', "\n" );
return false, but becareful that you cannot do:
print( $s ? 'defined' : 'undefined', "\n" );
Because $s is the object so it would always return true.
If you stringify it like
print( "$s" ? 'defined' : 'undefined', "\n" );
It would still return as defined, because this would be a defined string, albeit empty
unpack
my $array = $s->unpack( "W" );
# Returns the ord() value as a Module::Generic::Array object
$s->unpack( "W" )->length;
# Object context. Ditto
my( $date, $desc, $income, $expend ) = $s->unpack( "A10xA27xA7A*" );
# Returns a regular list of values
Does the reverse of "pack". Provided with a template, it takes the string object and expands it out into a list of values.
It returns an Module::Generic::Array object in object or scalar context, and a regular list in list context.
upper
Alias for "uc"
SERIALISATION
Serialisation by CBOR, Sereal and Storable::Improved (or the legacy Storable) is supported by this package. To that effect, the following subroutines are implemented: FREEZE
, THAW
, STORABLE_freeze
and STORABLE_thaw
SEE ALSO
Module::Generic::Number, Module::Generic::Array, Module::Generic::Boolean, Module::Generic::Hash, Module::Generic::Dynamic
AUTHOR
Jacques Deguest <jack@deguest.jp>
COPYRIGHT & LICENSE
Copyright (c) 2000-2024 DEGUEST Pte. Ltd.
You can use, copy, modify and redistribute this package and associated files under the same terms as Perl itself.