NAME

docs/debug.pod - Debugging Parrot

ABSTRACT

This document describes how to debug various parts of Parrot.

VERSION

$Revision $

THE PARROT BINARY

Using a debugger

Per default the parrot binary is being built with debugging symbols. This means that you can run parrot under an debugger like gdb.

Debugging support can be explicitly enabled with:

shell> perl Configure.pl --debugging
shell> make

For testing it might be a good idea to make test runs without debug support. So debugging can also be turned off with:

shell> perl Configure.pl --debugging=0
shell> make

Using a memory checker

You could, and should, also run the tests with a memory checker such as valgrind. You can enable valgrind, by running:

shell> make test VALGRIND="valgrind --logfile=/tmp/grind"

Another possibility is to use Electric Fence, or ...

MEMORY MANAGEMENT

Some of the more frequent and exasperating parrot bugs are related to memory management in general, and garbage collection in particular.

Infant mortality

See docs/dev/infant.pod for details of one frequent problem: infant mortality. Infant mortality is when you create a Parrot object, but the garbage collector runs before you put it into a Parrot register or in something else that is itself within a Parrot register.

To help in resolving these issues, the parrot binary accepts a --gc-debug flag. This flag makes garbage collection occur as frequently as possible, which maximizes the probability that any newborn objects will run afoul of the garbage collector.

Within the --gc-debug mode, there is another tool to help narrow down the problem. You can edit src/gc/dod.c and #define the GC_VERBOSE flag to 1. After recompiling parrot, the garbage collector will perform additional checks. After the garbage collector has traced all objects to find which ones are still alive, it will scan through all of the dead objects to see if any of them believe they are alive (which will happen for infants, since they come into existence marked live.) If it finds any, it will print them out. You can then re-run the program with a breakpoint set on the routine that allocated the object (e.g. get_free_object in src/gc/smallobject.c). You'll probably want to make the breakpoint conditional on the object having the version number that was reported, because the same memory location will probably hold many different objects over the lifetime of the program.

PIR AND PASM CODE

Let's say you have written (or generated) a huge .pasm or .pir file. It's not working. You'd like some help in figuring out why.

pdb

One possible tool is pdb, the Parrot Debugger. See docs/debugger.pod for details on it.

stabs

If you are running on a jit-capable machine, you can also try using gdb by having the JIT compiler generate stabs metadata and then stepping through the code with gdb as if it were any other language.

To use this, you'll want to use parrot to generate your bytecode (.pbc file). It is not strictly necessary, but you'll get more information into the bytecode this way.

Let's say your file is named test.pasm. (Note: these instructions will also work if you use test.pir everywhere test.pasm occurs.)

Step 1: Generate the .pbc file with extra debugging information.

shell> parrot -d -o test.pbc test.pasm

Step 2: Start up parrot under gdb

% gdb parrot

or

% emacs &
(in emacs) M-x gdb
(in emacs) type "parrot" so it says "gdb parrot"

Step 3: Set a breakpoint on runops_jit

gdb> b runops_jit

Step 4: Run your program under gdb with JIT and debugging on

gdb> run -j -D4 test.pbc

Step 5: gdb will stop at the beginning of runops_jit. Step through the lines until just before the JITed code is executed (the line will be something like (jit_code)(interpreter,pc).

gdb> n
gdb> n
.
.
.

Step 6: load in the debugging information from the symbol file that the jit just generated.

gdb> add-symbol-file test.o 0

Step 7: Step into the JITed code

gdb> s

At this point, you can step through the instructions, or print out the various Parrot registers. FIXME: gdb will know about I0-I31, N0-N31, S0-S31, and P0-P31.

WARNING: Stepping too far

One thing to watch out for is that gdb gets confused when attempting to step over certain instructions. The only ones that I have noticed having problems is keyed operations. With my version of gdb, if I do 'n' to step over the instruction, gdb will start running and only stop when the entire parrot program has finished. To work around this, do 'si' twice just before executing any keyed op. For some reason, gdb can then figure out when it's supposed to stop next. If you know of a better technique, please let the mailing list know (parrot-porters@perl.org).

PIR CODE GENERATION

The parrot binary has a bunch of debugging flags for spewing out information about various aspects of its processing. See running.pod for a list of flags. Or have a look at the information provided by:

shell> parrot --help

or

shell> parrot --help-debug

BACKTRACING

auto-magical

If Parrot is built on a system with GNU libc it is capable of automatically generating a backtrace on stderr for debugging purposes. Currently these automatically backtraces are only generated by assertion failures but in the future they also be produced by other bad events (for example, SEGV).

Here is an example of a what a backtrace might look like:

Backtrace - Obtained 15 stack frames (max trace depth is 32).
  (unknown)
    Parrot_confess
      Parrot_make_COW_reference
        Parrot_String_get_string
          Parrot_set_s_p
            (unknown)
              (unknown)
                (unknown)
                  (unknown)
                    Parrot_runops_fromc_args
                      Parrot_runcode
                        (unknown)
                          imcc_run
                            (unknown)
                              __libc_start_main
                                (unknown)

It must be noted that glibc's backtraces are not without limitation. It's method depends completely on information that is available at run time.

  • Functions marked as static can only be identified by address as they have no "symbol name" for dynamic linking in the executable's symbol table. Static functions will appears as (unknown).

  • There must be some means available for walking the stack at runtime. On x86(-64)? the "stack pointer" must be in [re]sp register. For example, this gcc compliiation flag would break backtracing (except for functions that do dynamic allocation on the stack as this optimization can no be allied to them). perl Configure.pl --ccflags=-fomit-frame-pointer

  • Some platforms may require extra linker flags in order to get all of the required symbols exported in the symbol table. Configure.pl --ccflags=-rdynamic

  • Any debugging information embedded in the object is not accessible. So file and line number can not be included as part of the backtrace information.

  • Be warned that signals may cause incorrect backtraces!

gdb

On systems not equipped with libc, one will need to use an external debugger to get backtrace information. This method is actually more capable then the auto-magical approach as most debuggers will use debugging information if it's available in the object code (for example, if parrot was built with -g).

Since the Parrot_confess symbol is always compiled into parrot it can be used as a break point to obtain a backtrace. Here is an example of doing this with gdb and a version of parrot compiled with <gcc> and the -g flag.

$ gdb parrot
GNU gdb 6.6
Copyright (C) 2006 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "i686-pc-linux-gnu"...
Using host libthread_db library "/lib/libthread_db.so.1".
(gdb) b main
Breakpoint 1 at 0x80488a0: file src/main.c, line 38.
(gdb) r foo.pir
Starting program: /home/moanui/jhoblitt/parrot/parrot foo.pir
Failed to read a valid object file image from memory.
[Thread debugging using libthread_db enabled]
[New Thread -1213900128 (LWP 23148)]
[Switching to Thread -1213900128 (LWP 23148)]

Breakpoint 1, main (argc=-400292727, argv=0x159a0) at src/main.c:38
38      {
(gdb) b Parrot_confess
Breakpoint 2 at 0xb7c542a0: file src/exceptions.c, line 767.
(gdb) c
Continuing.
[New Thread -1214039152 (LWP 23151)]
[New Thread -1222431856 (LWP 23152)]
1..1

Breakpoint 2, Parrot_confess (cond=0xb7eeda65 "s", 
    file=0xb7eeda58 "src/string.c", line=129) at src/exceptions.c:767
warning: Source file is more recent than executable.
767     {
(gdb) bt full
#0  Parrot_confess (cond=0xb7eeda65 "s", file=0xb7eeda58 "src/string.c", 
    line=129) at src/exceptions.c:767
No locals.
#1  0xb7c433b1 in Parrot_make_COW_reference (interp=0x804e008, s=0x0)
    at src/string.c:129
        d = (STRING *) 0x81c21b8
        __PRETTY_FUNCTION__ = "Parrot_make_COW_reference"
#2  0xb7e40db3 in Parrot_String_get_string (interp=0x804e008, pmc=0x81c8578)
    at src/pmc/string.c:310
No locals.
#3  0xb7cc7d41 in Parrot_set_s_p (cur_opcode=0x825d470, interp=0x804e008)
    at src/ops/set.ops:159
No locals.
#4  0xb7c9da32 in runops_slow_core (interp=0x804e008, pc=0x825d470)
    at src/runops_cores.c:184
No locals.
#5  0xb7c67acf in runops_int (interp=0x804e008, offset=0)
    at src/interpreter.c:816
        pc = (opcode_t * const) 0x8239730
        lo_var_ptr = 134537224
        core = (opcode_t *(*)(Parrot_Interp, 
    opcode_t *)) 0xb7c9d940 <runops_slow_core at src/runops_cores.c:169>
#6  0xb7c6854e in runops (interp=0x804e008, offs=0) at src/inter_run.c:100
        offset = 0
        old_runloop_id = 0
        our_runloop_level = 1
        our_runloop_id = 1
#7  0xb7c687da in runops_args (interp=0x804e008, sub=0x8204d58, obj=0x80912d8, 
    meth_unused=0x0, sig=0xb7eefca6 "vP", 
    ap=0xbfec614c "@M \b�b��P�\222K\230�\004\b@\236\"\b@M \bXM \b\004����t��\b�\004\b\001") at src/inter_run.c:216
        offset = 0
        dest = (opcode_t *) 0x8239730
        ctx = (parrot_context_t *) 0x822a3b0
        new_sig = ""
        sig_p = 0xb7eefca7 "P"
        old_ctx = (parrot_context_t * const) 0x804e298
#8  0xb7c688fb in Parrot_runops_fromc_args (interp=0x804e008, sub=0x8204d58, 
    sig=0xb7eefca6 "vP") at src/inter_run.c:293
        args = 0xbfec614c "@M \b�b��P�\222K\230�\004\b@\236\"\b@M \bXM \b\004����t��\b�\004\b\001"
        ctx = (parrot_context_t *) 0xb7fa1548
#9  0xb7c50c51 in Parrot_runcode (interp=0x804e008, argc=1, argv=0xbfec62e8)
    at src/embed.c:783
        userargv = (PMC *) 0x8204d40
        main_sub = (PMC *) 0x8204d58
#10 0xb7ed74a1 in imcc_run_pbc (interp=0x804e008, obj_file=0, output_file=0x0, 
    argc=1, argv=0xbfec62e8) at compilers/imcc/main.c:614
No locals.
#11 0xb7ed7d90 in imcc_run (interp=0x804e008, sourcefile=0xbfec6e0a "foo.pir", 
    argc=1, argv=0xbfec62e8) at compilers/imcc/main.c:815
        obj_file = 0
        yyscanner = (yyscan_t) 0x822a090
        output_file = 0x0
#12 0x080489b7 in main (argc=136704448, argv=0x825f220) at src/main.c:62
        sourcefile = 0xbfec6e0a "foo.pir"
        interp = (Interp *) 0x804e008
        executable_name = (STRING *) 0x821b8e4
        executable_name_pmc = (PMC *) 0x8204d70
        status = 1267896320
(gdb) 

1 POD Error

The following errors were encountered while parsing the POD:

Around line 295:

Non-ASCII character seen before =encoding in '\b�b��P�\222K\230�\004\b@\236\"\b@M'. Assuming UTF-8