NAME
XS::Check - Check XS for some common problems
SYNOPSIS
use FindBin '$Bin';
use XS::Check;
my $check = XS::Check->new ();
$check->check_file ("$Bin/synopsis.xs");
produces output
/usr/home/ben/projects/xs-check/examples/synopsis.xs:3: x not a constant type.
/usr/home/ben/projects/xs-check/examples/synopsis.xs:3: len is not a STRLEN variable (unsigned int ).
(This example is included as synopsis.pl in the distribution.)
VERSION
This documents version 0.12 of XS-Check corresponding to git commit 09c996b41cd1b91df7a1bb10a3b5624bb361939f released on Sun May 9 10:44:08 2021 +0900.
DESCRIPTION
This module offers ways to check XS files for some common flaws which we have tripped over.
METHODS
new
my $check = XS::Check->new ();
Make a new XS::Check object. The checks are then run using "check" or "check_file".
Changing where the messages go
The messages from "check" or "check_file" are usually printed using Perl's built-in warn function. If you need to have errors reported some other way, supply a code reference to new
with the key reporter
as follows:
my $usercheck = XS::Check->new (reporter => sub { print "help!" });
The function you supply is then called back when "check" or "check_file" find something to remark on. The function is called with a hash containing the fields
- file
-
the file name of the file where the error occurred, if using "check_file" or if set with "set_file", otherwise the undefined value (
undef
), - line
-
the line number where the error occurred, starting from 1,
- message
-
the message from the module, a text string.
The following example demonstrates a user-defined callback using the message
and line
fields:
use XS::Check;
my $rchecker = XS::Check->new (reporter => \& reporter);
$rchecker->check ("Perl_croak ('croaking');\n");
sub reporter
{
my %rstuff = @_;
print "$rstuff{message} at $rstuff{line}.\n";
}
produces output
Remove the 'Perl_' prefix from Perl_croak at 1.
(This example is included as reporter.pl in the distribution.)
This was added in version 0.07 of the module..
check
$check->check ($xs);
See "SUGGESTIONS" for what this reports.
check_file
$check->check ($xs_file);
Convenience method to read in $xs_file
then run "check" on it.
This assumes UTF-8 encoding of $xs_file
.
set_file
$check->set_file ($file);
Set the file name for error reporting. Use any false value to clear it. For example:
use XS::Check;
my $check = XS::Check->new ();
my $xs = "Perl_croak (\"frog\")\n";
$check->check ($xs);
$check->set_file ('Yabadabado');
$check->check ($xs);
$check->set_file ('');
$check->check ($xs);
produces output
1: Remove the 'Perl_' prefix from Perl_croak.
Yabadabado:1: Remove the 'Perl_' prefix from Perl_croak.
1: Remove the 'Perl_' prefix from Perl_croak.
(This example is included as set-file.pl in the distribution.)
This was added in version 0.08 of the module.
SUGGESTIONS
This section details the possible suggestions made by the module and the motivations behind them.
Use STRLEN in SvPV
Using an int
type for the second argument to SvPV
may cause errors on 64-bit Perls, because within the macro the address of the variable is taken, and then it is sent to a Perl function, and if the length doesn't match the length of Perl's STRLEN
an error may occur.
Use const char * for return value of SvPV
The pointer returned by SvPV
is the actual Perl buffer, not a copy, so unless one actually wants to write into it, it's better to use const char *
to make sure one does not overwrite it.
malloc/calloc/realloc/free
The C standard library functions malloc
, calloc
, realloc
, and free
should usually be replaced with Newx
, Newxz
, Renew
, and Safefree
respectively in Perl XS code, because the C standard library functions may cause "free to wrong pool" errors on multithreaded Windows Perls.
Perl_ prefix
Functions of the form Perl_croak
should usually not be used, just croak
. The Perl_
prefix functions are the actual functions and croak
and other such functions are actually macros, but these macros contain hidden arguments. (The hidden arguments are the pTHX_
and similar things seen in the Perl source code.)
This was added in version 0.04 of the module.
Don't use (void) in arguments
XS functions cannot use the ANSI C (void)
to indicate that they do not take any arguments, instead this results in a variable called "void" being created.
This was added in version 0.06 of the module.
Dereferencing av_fetch or hv_fetch
One should not dereference the return value of av_fetch
or hv_fetch
without checking for NULL
(zero pointer) since it is possible to get NULL
, for example if an array is created with only a tenth element.
For an extended discussion, see http://blogs.perl.org/users/ben_bullock/2020/02/av-fetch-can-return-null.html.
Currently this check does not actually check that the returned value is checked for non-nullness before being dereferenced, only that there is nothing of the form * av_fetch
in your code.
This was added in version 0.09 of the module.
Put whitespace before hash comments
The XS manual suggests putting whitespace before # comments to distinguish them from preprocessor statements.
See https://perldoc.perl.org/perlxs#Inserting-POD,-Comments-and-C-Preprocessor-Directives.
Comments can be added to XSUBs by placing a # as the first non-whitespace of a line. Care should be taken to avoid making the comment look like a C preprocessor directive, lest it be interpreted as such. The simplest way to prevent this is to put whitespace in front of the #.
This was added in version 0.09 of the module..
Add one to av_len
The av_len
function is something of a booby trap in that it returns the length of an AV *
minus one, so if it is used as-is, one element of the array will be missed. The module does a simplistic check of seeing whether you have added one to av_len
.
This was added in version 0.10 of the module.
LIMITATIONS
As of 0.12, the module has the following limitations.
- Struct members
-
The module is not very good at parsing struct members, so XS code like the following doesn't get dealt with properly:
s.txt = SvPV (sv, s.len);
- UTF-8 only
-
"check_file" uses "read_text" in File::Slurper to read the text, which means it only takes UTF-8.
- Variable declarations rely on a simplistic hack
-
The current method of parsing variable declarations uses a very simplistic hack, and it is likely to produce false results if a variable name is used twice for two different things in the same file.
- Variables declared within function definitions are not parsed
-
The following variable
length
is not dealt with correctly:static void sv_to_text_fuzzy (SV * text, STRLEN length) { const unsigned char * stuff; /* Copy the string in "text" into "text_fuzzy". */ stuff = (unsigned char *) SvPV (text, length);
DEPENDENCIES
- C::Tokenize
-
This supplies the regular expressions used to parse C by the module.
- "read_text" in File::Slurper
-
This is used by "check_file".
- Text::LineNumber
-
This is used to get the line numbers.
- Carp
COMMAND-LINE TOOL
A command line tool called checkxs
is installed with the module. It runs the "check_file" method on each file named on the command line.
checkxs Some.xs
There are two options:
SEE ALSO
Other CPAN modules
- ExtUtils::ParseXS
-
ExtUtils::ParseXS is Perl's parser for XS which converts XS code into C.
- Test::XS::Check
-
Test::XS::Check is a test module based on this one.
- XS::Tutorial
-
XS::Tutorial is a tutorial about programming in XS.
More information
- Perl XS modules and CPAN testers
-
A collection of more or less obscure bugs found by CPAN testers, the original inspiration for this module.
AUTHOR
Ben Bullock, <bkb@cpan.org>
COPYRIGHT & LICENCE
This package and associated files are copyright (C) 2017-2021 Ben Bullock.
You can use, copy, modify and redistribute this package and associated files under the Perl Artistic Licence or the GNU General Public Licence.