Changes for version 0.04 - 2012-12-10
- smaller changes to make the distribution cleaner for CPAN
Documentation
a script for converting bitexts
convert a treebank from one format to another
count co-occurrence frequencies for arbitrary features of nodes in a parallel treebank
convert from Stockholm Tree Aligner format to Moses/GIZA++ (plain text)
extract aligned phrases from aligned treebanks
training tree alignment classifiers and aligning syntactic trees
a script for computing precision and recall scores for tree aligmnent
convert treebanks to Moses/GIZA++ format (plain text)
Modules
Perl modules for the alignment of parallel corpora
Class factory for link classification
A wrapper around megam
reading corpus data
Read factored corpora (Moses format)
Class factory for reading parallel corpora
Read sentence aligned bitexts
Read Dublin Subtree Aligner format
Read the Viterbi word alignment produced by GIZA++
Perl extension to read sentence-aligned parallel corpora in Moses format
Read parallel corpora in OPUS format
read parallel corpora with ordered sentence IDs
Read the STockholm Tree Aligner Format
Read data from the WPT word alignment task
Factory class for reading treebanks
Read Alpino XML
Read the output of the Berkeley parser
Read the Penn Treebank format
Read output from the Stanford parser
Read the TigerXML format
Feature extraction for tree alignment
Search algorithms for tree alignment
Alignment as an assignment problem in bipartite graphs
Alignment as an assignment problem with additional constraints
cascaded link search strategies
Simple greedy search for links
Greedy link search with a final step for adding links between unaligned items.
Greedy search with wellformedness constraints
Intersection between source-to-target and target-to-source alignment
Align non-terminal nodes first
Align non-terminal nodes only (greedily)
Link search used in the PaCoMT project
Source-to-target alignment
Source-to-target alignment with constraints
Greedy linking with score thresholds
Greedily align terminal nodes only
Greedy target-to-source alignment
Perl modules implementing a discriminative tree aligner
Module for word alignment
Provides
in lib/Lingua/Align/Classifier/Clues.pm
in lib/Lingua/Align/Classifier/LibSVM.pm
in lib/Lingua/Align/Features/Alignment.pm
in lib/Lingua/Align/Features/Cooccurrence.pm
in lib/Lingua/Align/Features/History.pm
in lib/Lingua/Align/Features/Lexical.pm
in lib/Lingua/Align/Features/Orthography.pm
in lib/Lingua/Align/Features/Tree.pm
in lib/Lingua/Align/LinkSearch/GreedyFinalAnd.pm
in lib/Lingua/Align/LinkSearch/Viterbi.pm
Examples
- examples/README
- examples/europarl/ep-00-12-15.125.en.penn
- examples/europarl/ep-00-12-15.125.en.tiger
- examples/europarl/ep-00-12-15.125.nl.penn
- examples/europarl/ep-00-12-15.125.nl.tiger
- examples/europarl/moses/giza.src-trg/src-trg.A3.final.gz
- examples/europarl/moses/giza.trg-src/trg-src.A3.final.gz
- examples/europarl/moses/model/aligned.grow-diag
- examples/europarl/moses/model/aligned.ids
- examples/europarl/moses/model/aligned.intersect
- examples/europarl/moses/model/lex.0-0.e2f
- examples/europarl/moses/model/lex.0-0.f2e
- examples/europarl/nl-en-weak_125.xml
- examples/europarl/nl-en_125.dublin
- examples/europarl/nl-en_125.xml
- examples/smultron/align.pl
- examples/smultron/moses-eco/corpus/src-trg-int-train.snt
- examples/smultron/moses-eco/corpus/src.vcb
- examples/smultron/moses-eco/corpus/src.vcb.classes
- examples/smultron/moses-eco/corpus/src.vcb.classes.cats
- examples/smultron/moses-eco/corpus/trg-src-int-train.snt
- examples/smultron/moses-eco/corpus/trg.vcb
- examples/smultron/moses-eco/corpus/trg.vcb.classes
- examples/smultron/moses-eco/corpus/trg.vcb.classes.cats
- examples/smultron/moses-eco/giza.src-trg/src-trg.A3.final.gz
- examples/smultron/moses-eco/giza.src-trg/src-trg.gizacfg
- examples/smultron/moses-eco/giza.trg-src/trg-src.A3.final.gz
- examples/smultron/moses-eco/giza.trg-src/trg-src.gizacfg
- examples/smultron/moses-eco/model/aligned.0.src
- examples/smultron/moses-eco/model/aligned.0.trg
- examples/smultron/moses-eco/model/aligned.intersect
- examples/smultron/moses-eco/model/lex.0-0.e2f
- examples/smultron/moses-eco/model/lex.0-0.f2e
- examples/smultron/moses-sophie/corpus/src-trg-int-train.snt
- examples/smultron/moses-sophie/corpus/src.vcb
- examples/smultron/moses-sophie/corpus/src.vcb.classes
- examples/smultron/moses-sophie/corpus/src.vcb.classes.cats
- examples/smultron/moses-sophie/corpus/trg-src-int-train.snt
- examples/smultron/moses-sophie/corpus/trg.vcb
- examples/smultron/moses-sophie/corpus/trg.vcb.classes
- examples/smultron/moses-sophie/corpus/trg.vcb.classes.cats
- examples/smultron/moses-sophie/giza.src-trg/src-trg.A3.final.gz
- examples/smultron/moses-sophie/giza.src-trg/src-trg.cooc
- examples/smultron/moses-sophie/giza.src-trg/src-trg.gizacfg
- examples/smultron/moses-sophie/giza.trg-src/trg-src.A3.final.gz
- examples/smultron/moses-sophie/giza.trg-src/trg-src.cooc
- examples/smultron/moses-sophie/giza.trg-src/trg-src.gizacfg
- examples/smultron/moses-sophie/model/aligned.0.src
- examples/smultron/moses-sophie/model/aligned.0.trg
- examples/smultron/moses-sophie/model/aligned.intersect
- examples/smultron/moses-sophie/model/lex.0-0.e2f
- examples/smultron/moses-sophie/model/lex.0-0.f2e
- examples/smultron/train.pl
- examples/smultron/train_align.pl
- examples/test-scripts/test_alpino.pl
- examples/test-scripts/test_bitext.pl
- examples/test-scripts/test_dublin.pl
- examples/test-scripts/test_giza.pl
- examples/test-scripts/test_moses.pl
- examples/test-scripts/test_opus.pl
- examples/test-scripts/test_sta.pl
- examples/test-scripts/test_stanford.pl
- examples/test-scripts/test_tiger.pl
- examples/test-scripts/test_treealign.pl
- examples/test-scripts/test_wpt.pl
- examples/wpt03/moses/giza.e-f/A3.final.447.gz
- examples/wpt03/moses/giza.f-e/A3.final.447.gz
- examples/wpt03/moses/model/aligned.grow-diag-final.447
- examples/wpt03/moses/model/aligned.intersect.447
- examples/wpt03/test.e
- examples/wpt03/test.f
- examples/wpt03/test.wa.nonullalign
- examples/wpt03/test.wa.nullalign
- examples/wpt03/test.wa.test
- examples/wpt03/test.wa.train