NAME
Treex::Block::W2A::EN::FixTags - Fixes tags for TectoMT purposes.
VERSION
version 2.20151102
DESCRIPTION
- sooner
-
"sooner" and "later" are always tagged as
RBR
(comparative adverb) Beware that this goes against Penn Treebank Tagging Guidelines. - "e. g." -> FW (according to PTB Guidelines)
- numbers
-
All numbers (
/^\d+$/
) get tag CD. - plural abbreviations
-
Abbreviations like MPs, CDs or DVDs are tagged as plural proper noun (
NNPS
).
OVERRIDEN METHODS
from Treex::Core::Block
- process_anode
AUTHORS
Zdeněk Žabokrtský <zabokrtsky@ufal.mff.cuni.cz>
Martin Popel <popel@ufal.mff.cuni.cz>
COPYRIGHT AND LICENSE
Copyright © 2008 - 2011 by Institute of Formal and Applied Linguistics, Charles University in Prague
This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.