NAME
Treex::Tutorial::Install - Installing the Treex NLP framework
VERSION
version 0.07297
TWO WAYS TO GET TREEX
You can download Treex from CPAN http://www.cpan.org or from the SVN repository. However, so far only the latter is usable, because only the core modules (Treex::Core
) were released on CPAN and to do something interesting with Treex you need also some other modules.
Downloading and Installing Treex from CPAN
You can install Treex and its dependecies from shell using the cpan
tool:
cpan Treex::Core
In future, there will be also distributions for the analysis (and synthesis) of several languages: Treex::EN
for English, Treex::DE
for German, Treex::CS
for Czech etc.
Downloading and Installing Treex from SVN
Treex is being developed in TectoMT subversion repository. For read-only access you can use:
cd ~
svn --username public co https://svn.ms.mff.cuni.cz/svn/tectomt_devel/trunk tectomt
# password is "public"
Install the needed Perl modules dependencies:
cpan Moose MooseX::Getopt MooseX::NonMoose MooseX::Params::Validate\
MooseX::SemiAffordanceAccessor Readonly Treex::PML\
File::Slurp File::HomeDir File::Path File::ShareDir\
LWP::Simple Data::Dumper Module::Reload Locale::Language\
Parse::RecDescent Cwd Scalar::Util autodie String::Util
Add Treex modules to your $PERLLIB
so perl can find them, and the treex
script to your $PATH
so you can execute it from shell. Add the following lines to your ~/.bashrc
:
export TMT_ROOT=${HOME}/tectomt/
export PATH="${TMT_ROOT%/}/treex/bin:$PATH"
export PERL5LIB="${TMT_ROOT%/}/treex/lib:${TMT_ROOT%/}/libs/other:$PERL5LIB"
export PERLLIB=$PERL5LIB
And source ~/.bashrc
or open a new terminal. Check that it works by running:
treex -h
Some Treex applications use huge pre-trained models, dictionaries etc. These resources are stored in an unversioned directory $TMT_ROOT/share
and they should be automatically downloaded when running the respective application. You should create the empty directory or symlink.
mkdir $TMT_ROOT/share
#ln -s /mnt/my_big_disk/tectomt_share $TMT_ROOT/share
Similarly, some applications need a directory for temporary files.
ln -s /tmp $TMT_ROOT/tmp
#mkdir $TMT_ROOT/tmp
Installing other tools
TrEd
For visualization of trees it is beneficial to install TrEd from http://ufal.mff.cuni.cz/~pajas/tred. Install it to the $TMT_ROOT/share/tred
directory (or symlink). Now you can check if you can run the Treex-modified version of TrEd:
ttred
XS-based perl modules
Morce is a high-quality part-of-speech tagger for English. You can install it by:
cd $TMT_ROOT/libs/packaged/Morce-English
less INSTALL
# follow the instruction in INSTALL
In the same way, you can install CzechMorpho
and Morce-Czech
(in this order).
AUTHOR
Martin Popel <popel@ufal.mff.cuni.cz>
COPYRIGHT AND LICENSE
Copyright © 2011 by Institute of Formal and Applied Linguistics, Charles University in Prague
This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
1 POD Error
The following errors were encountered while parsing the POD:
- Around line 103:
Non-ASCII character seen before =encoding in '©'. Assuming UTF-8