NAME

Getting Started with the Parrot Compiler Tools - A Complete Beginner's Guide to the Parrot Compiler Tools.

DESCRIPTION

This document can be considered your Number One entry point for starting to use the Parrot Compiler Tools (PCT). As there's a whole lot of acronyms flying around (all of which can be found in Parrot's glossary at http://www.parrotcode.org/glossary.html), this document will get you up and running within 10 minutes (that excludes building Parrot). Once you got started, it's a matter of getting your hands dirty and get experienced using the tools. Feel free to ask questions on irc.perl.org#parrot.

GETTING STARTED

Getting started using the PCT is easy, and this document will guide you through the steps. An overview:

  • download and build Parrot

  • generate a language stub

  • customize your language

The acronyms you should learn are the following:

  • PASM

    Stands for Parrot Assembly language, and is a textual form of the bytecodes that Parrot is running. PASM's syntax is very primitive, which is a pain to write, which is why Parrot has something called PIR (read on).

  • PIR

    Stands for Parrot Intermediate Representation. This is a fancy layer of syntactic sugar on top of PASM. If you program Parrot natively, you write in PIR. Other documents discuss PIR syntax, for instance http://www.parrotcode.org/docs/pdd/pdd19_pir.html.

  • PGE

    Stands for Parrot Grammar Engine, and is the regular expression engine of Parrot. It is written in PIR. Regular expressions in Perl 6 are more powerful than Perl 5's regexes, as you can write language grammars in it more easily. These regular expressions are written in Perl 6 rules. See Perl 6 synopis 5 (S05) for the syntax of Perl 6 rules. A grammar is processed by PGE to create a language parser. The grammar can contain special tokens that look like "{*}", which invoke a subroutine by the same name as the current rule. These invoked subroutines are commonly called actions.

  • NQP

    Stands for Not Quite Perl, and is a subset of Perl 6. Yeah, that's right, you can already program in Perl 6 today (well, if you're happy with a simpler version of the language). NQP is implemented in PIR. The reason for building NQP was that it makes writing the parse actions (see the entry for PGE) a whole lot easier. Although PIR is a neat language, it's still quite primitive.

  • PAST

    PAST stands for Parrot Abstract Syntax Tree, and is a library of classes that define the nodes for abstract syntax trees. Typical node types are PAST::Val representing literal values (such as 42, "Hello World", etc.) and PAST::Var which represents variables (for instance when writing "my $var;" in Perl 6). The parse actions discussed earlier can construct these PAST nodes, so that at the end of the parse, you have a complete abstract syntax tree representing the program you compiled.

  • POST

    POST stands for Parrot Opcode Syntax Tree. This is another library of classes that define the nodes for so-called opcode syntax trees. For this beginner's guide you can forget about it, but at some point you'll see the term POST. Just forget about it for now.

Now we discussed the most important acronyms, it's time to get up and running.

Download and Build Parrot

Get Parrot from http://www.parrotcode.org/source.html and build it. If you're lucky and you have a fast computer, it should be done within 5 minutes. It's always useful to run the test suite, by typing:

$ make test

Generate a Language Stub

There's a special script for newcomers like yourself reading this. It's located in tools/dev/mk_language_shell.pl. You invoke it like so (assuming you're in parrot's root directory):

$ perl tools/dev/mk_language_shell <language> <location>

For instance, if you want to create a language called Foo and locate it in the directory languages/foo, you'd type:

$ perl tools/dev/mk_language_shell Foo languages/foo

This will create a complete language that compiles out of the box, except for the fact that there's no Makefile yet. In order to resolve that, you should make a small edit to a file located in config/gen/languages.pm. You'll see an array named languages being initialized with a list of languages. Just add the name of the directory of your language, in this example's case foo. Save the file, and run the following command from the parrot root directory:

$ perl Configure.pl

This will regenerate all makefiles, including your language's one. Now, in order to compile your language, type this:

$ cd languages/foo
$ make
$ make test

Yes, that's right, there's even a test file already created for you. This makes setting up the tests for your language very easy!

The generated directories and files have the following structure:

foo/
   /config/makefiles/root.in  # input for the Makefile generator
                              # as long as you don't add source files,
                              # there's no need to update this file.
   /src/
       /parser/
              /actions.pm     # the language's grammar rules
              /grammar.pm     # a file containing the parse actions
                              # these files should be updated to change your
                              # language
       /builtins/
                /say.pir      # a file containing a built-in function
                              # add more "standard library" routines here
   /t/
     /00-sanity.t             # a test file
     /harness                 # file to set up the test framework
                              # more tests can be added here

   /foo.pir                   # file containing the main routine

When you want to run a script through your language's compiler, (assuming you're located in your language's directory, in this example's case languages/foo) type:

$ ../../parrot foo.pbc <file.foo>

You can give an command line option to your compiler which specifies what kind of output you want. This is the target option:

$ ../../parrot foo.pbc --target=pir <file.foo>

this will print the generated PIR instructions to stdout. Other options for the target option are: parse, past, post.

Customize Your Language

You probably have some language syntax in mind that you'd want to implement. Note that the grammar defined in the file languages/foo/src/parser/grammar.pg and the parse actions in the file languages/foo/src/parser/actions.pm are closely related (note the names of the action methods). It's very important to update the methods accordingly if you change the grammar rules.

COMMON ERROR MESSAGES

This section describes some common error messages and how to resolve them. This is a work in progress, so you might not find your issue/solution here. If you have anything new to add, please send a patch (or an email) to parrotbug@parrotcode.org.

  • no result object

    This is the case when you try to retrieve the result object from a subrule, but the subrule's action didn't set a result object using the make command. Check whether there's an action invocation token "{*}" in the subrule and whether that subrule's action has a make command.

WHERE TO GO FROM HERE?

Documents

The following documents might be useful to learn more:

Other Languages

You can also have a look at some existing languages that are being developed using the PCT, all located in the languages subdirectory of Parrot. These are: perl6 (commonly referred to as Rakudo), lua (see the "pct" in lua directory), ecmascript (a standardized JavaScript), punie (Perl 1 on Parrot), pynie (python on Parrot) and cardinal (Ruby on Parrot).

IRC

Everyday, a bunch of Parrot enthusiasts can be found on #parrot on irc.perl.org. You're welcome to ask questions.

SUGGESTIONS

If you have suggestions, improvements, tips or complaints about this document, please send an email to parrot-porters@perl.org.