NAME
sta2moses - a script that converts an aligned parallel treebank to plain text format (Moses/Giza++ format). Alignment has to be in sta (Stockholm Tree Aligner) format.
SYNOPSIS
sta2moses alignments.xml
DESCRIPTION
This script reads through a parallel treebank using the tree alignment file (alignments.xml) and produces sentence aligned plain text files (to be used with Moses/Giza++). The corpus will be stored in alignments.src and alignments.trg.
SEE ALSO
AUTHOR
Joerg Tiedemann, <jorg.tiedemann@lingfil.uu.se>
COPYRIGHT AND LICENSE
Copyright (C) 2009 by Joerg Tiedemann
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.8 or, at your option, any later version of Perl 5 you may have available.