NAME
Lingua::YaTeA::ChunkingDataSet - Perl extension for the set of chuncking data
SYNOPSIS
use Lingua::YaTeA::ChunkingDataSet;
Lingua::YaTeA::ChunkingDataSet->new($file_set);
DESCRIPTION
The module implements sets of chunking data, i.e. chunking frontiers (field ChunkingFrontiers
), chunking exceptions (field ChunkingExceptions
), cleaning frontiers (CleaningFrontiers
), and cleaning exceptions (CleaningExceptions
). Chunking data are stored in subsets (Lingua::YaTeA::ChunckingSubset
).
METHODS
new()
new($file_set);
The method creates a set of chunking data and the related subset. The data stored in the directory c<$file_set> (an object Lingua::FileSet
) are loaded in the subsets.
loadData()
loadData($file_set);
The method loads the chuncking data stored in the directory c<$file_set> (an object Lingua::FileSet
) in the related subsets: chunking frontiers (field ChunkingFrontiers
), chunking exceptions (field ChunkingExceptions
), cleaning frontiers (CleaningFrontiers
), and cleaning exceptions (CleaningExceptions
).
getSubset()
getSubset($name);
The method returns the subset corresponding to the field $name
(i.e. ChunkingFrontiers, ChunkingExceptions, CleaningFrontiers, CleaningExceptions).
existData()
existData($set,$type,$data);
The methods checks if the data $data
exists in the field $type
in the subset $set
. It returns 1 if it exists, otherwise 0.
SEE ALSO
Sophie Aubin and Thierry Hamon. Improving Term Extraction with Terminological Resources. In Advances in Natural Language Processing (5th International Conference on NLP, FinTAL 2006). pages 380-387. Tapio Salakoski, Filip Ginter, Sampo Pyysalo, Tapio Pahikkala (Eds). August 2006. LNAI 4139.
AUTHOR
Thierry Hamon <thierry.hamon@univ-paris13.fr> and Sophie Aubin <sophie.aubin@lipn.univ-paris13.fr>
COPYRIGHT AND LICENSE
Copyright (C) 2005 by Thierry Hamon and Sophie Aubin
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.6 or, at your option, any later version of Perl 5 you may have available.