NAME
Getting Started with the Parrot Compiler Tools - A Complete Beginner's Guide to the Parrot Compiler Tools.
DESCRIPTION
This document can be considered your Number One entry point for starting to use the Parrot Compiler Tools (PCT). As there's a whole lot of acronyms flying around (all of which can be found in Parrot's glossary at http://www.parrotcode.org/glossary.html), this document will get you up and running within 10 minutes (that excludes building Parrot). Once you got started, it's a matter of getting your hands dirty and get experienced using the tools. Feel free to ask questions on irc.perl.org#parrot.
GETTING STARTED
Getting started using the PCT is easy, and this document will guide you through the steps. An overview:
download and build Parrot
generate a language stub
customize your language
The acronyms you should learn are the following:
PASM
Stands for Parrot Assembly language, and is a textual form of the bytecodes that Parrot is running. PASM's syntax is very primitive, which is a pain to write, which is why Parrot has something called PIR (read on).
PIR
Stands for Parrot Intermediate Representation. This is a fancy layer of syntactic sugar on top of PASM. If you program Parrot natively, you write in PIR. Other documents discuss PIR syntax, for instance http://www.parrotcode.org/docs/pdd/pdd19_pir.html.
PGE
Stands for Parrot Grammar Engine, and is the regular expression engine of Parrot. It is written in PIR. Regular expressions in Perl 6 are more powerful than Perl 5's regexes, as you can write language grammars in it more easily. These
regular expressions
are written in Perl 6rules
. See Perl 6 synopis 5 (S05) for the syntax of Perl 6 rules. A grammar is processed by PGE to create a language parser. The grammar can contain special tokens that look like "{*}", which invoke a subroutine by the same name as the current rule. These invoked subroutines are commonly calledactions
.NQP
Stands for Not Quite Perl, and is a subset of Perl 6. Yeah, that's right, you can already program in Perl 6 today (well, if you're happy with a simpler version of the language). NQP is implemented in PIR. The reason for building NQP was that it makes writing the parse actions (see the entry for
PGE
) a whole lot easier. Although PIR is a neat language, it's still quite primitive.PAST
PAST stands for Parrot Abstract Syntax Tree, and is a library of classes that define the nodes for abstract syntax trees. Typical node types are
PAST::Val
representing literal values (such as 42, "Hello World", etc.) andPAST::Var
which represents variables (for instance when writing "my $var;" in Perl 6). The parse actions discussed earlier can construct these PAST nodes, so that at the end of the parse, you have a complete abstract syntax tree representing the program you compiled.POST
POST stands for Parrot Opcode Syntax Tree. This is another library of classes that define the nodes for so-called opcode syntax trees. For this beginner's guide you can forget about it, but at some point you'll see the term
POST
. Just forget about it for now.
Now we discussed the most important acronyms, it's time to get up and running.
Download and Build Parrot
Get Parrot from http://www.parrotcode.org/source.html
and build it. If you're lucky and you have a fast computer, it should be done within 5 minutes. It's always useful to run the test suite, by typing:
$ make test
Generate a Language Stub
There's a special script for newcomers like yourself reading this. It's located in tools/dev/mk_language_shell.pl
. You invoke it like so (assuming you're in parrot's root directory):
$ perl tools/dev/mk_language_shell <language> <location>
For instance, if you want to create a language called Foo
and locate it in the directory languages/foo
, you'd type:
$ perl tools/dev/mk_language_shell Foo languages/foo
This will create a complete language that compiles out of the box, except for the fact that there's no Makefile yet. In order to resolve that, you should make a small edit to a file located in config/gen/languages.pm
. You'll see an array named languages
being initialized with a list of languages. Just add the name of the directory of your language, in this example's case foo
. Save the file, and run the following command from the parrot root directory:
$ perl Configure.pl
This will regenerate all makefiles, including your language's one. Now, in order to compile your language, type this:
$ cd languages/foo
$ make
$ make test
Yes, that's right, there's even a test file already created for you. This makes setting up the tests for your language very easy!
The generated directories and files have the following structure:
foo/
/config/makefiles/root.in # input for the Makefile generator
# as long as you don't add source files,
# there's no need to update this file.
/src/
/parser/
/actions.pm # the language's grammar rules
/grammar.pm # a file containing the parse actions
# these files should be updated to change your
# language
/builtins/
/say.pir # a file containing a built-in function
# add more "standard library" routines here
/t/
/00-sanity.t # a test file
/harness # file to set up the test framework
# more tests can be added here
/foo.pir # file containing the main routine
When you want to run a script through your language's compiler, (assuming you're located in your language's directory, in this example's case languages/foo
) type:
$ ../../parrot foo.pbc <file.foo>
You can give an command line option to your compiler which specifies what kind of output you want. This is the target
option:
$ ../../parrot foo.pbc --target=pir <file.foo>
this will print the generated PIR instructions to stdout. Other options for the target
option are: parse
, past
, post
.
Customize Your Language
You probably have some language syntax in mind that you'd want to implement. Note that the grammar defined in the file languages/foo/src/parser/grammar.pg
and the parse actions in the file languages/foo/src/parser/actions.pm
are closely related (note the names of the action methods). It's very important to update the methods accordingly if you change the grammar rules.
COMMON ERROR MESSAGES
This section describes some common error messages and how to resolve them. This is a work in progress, so you might not find your issue/solution here. If you have anything new to add, please send a patch (or an email) to parrotbug@parrotcode.org
.
no result object
This is the case when you try to retrieve the result object from a subrule, but the subrule's action didn't set a result object using the
make
command. Check whether there's an action invocation token "{*}" in the subrule and whether that subrule's action has amake
command.
WHERE TO GO FROM HERE?
Documents
The following documents might be useful to learn more:
http://dev.perl.org/perl6/doc/design/syn/S05.html
Other Languages
You can also have a look at some existing languages that are being developed using the PCT, all located in the languages
subdirectory of Parrot. These are: perl6 (commonly referred to as Rakudo
), lua (see the "pct" in lua directory), ecmascript (a standardized JavaScript
), punie (Perl 1 on Parrot), pynie (python on Parrot) and cardinal (Ruby on Parrot).
IRC
Everyday, a bunch of Parrot enthusiasts can be found on #parrot on irc.perl.org. You're welcome to ask questions.
SUGGESTIONS
If you have suggestions, improvements, tips or complaints about this document, please send an email to parrot-porters@perl.org
.