NAME
Lingua::YaTeA::Island - Perl extension for island of reliability
SYNOPSIS
use Lingua::YaTeA::Island;
Lingua::YaTeA::Island->new($index,$type,$source);
DESCRIPTION
This module reprensents the island of reliability and provided related methods for manipulating if. An island of reliability is a subsequence (contiguous or not) of a Maximal Noun Phrase (MNP) that corresponds to a shorter term candidate that was parsed during the first step of the parsing process.
An island is defined with a list of parsed phrase (i.e. the sequence of Part-of-Speech tags) corresponding to the current island (field SOURCE
), the index set for the parsed phrase corresponding to the current island (field INDEX_SET
), the origin of the island (field TYPE
; value endogenous
if issued from the parsing of the current text, exogenous
if issued fom an input resource or previous text parsing).
An identifier (recorded in the field ID
is associated to the isalnd. the information that the island is used in the parsing of a wider parsed phrase or island, is recorded in the field INTEGRATED
(the default value is 0).
METHODS
new()
new($index,$type,$source);
The method defined a new island. $source
is the list of parsed phrase (i.e. the concatenation of Part-of-Speech tags or the key of the pharses) corresponding to the island. $index
is the index set for the parsed phrase corresponding to the current island. $type
is the origin of the island (value endogenous
if issued from the parsing of the current text, exogenous
if issued fom an input resource or previous text parsing).
getIndexSet()
getIndexSet();
The method returns the index set for the parsed phrase corresponding to the current island.
getType()
getType();
The method returns the origin of the island (endogenous
or exogenous
).
getParsingMethod()
getParsingMethod();
The method return the parsing methods associated to the parsed phrase corresponding to the island of reliability.
getIF()
getIF();
The method returns the inflected form of the parsed phrase corresponding to the island of reliability.
getSource()
getSource();
The method return the parsed phrase (i.e. the sequence of Part-of-Speech tags) corresponding to the island of reliability.
getID()
getID();
The method returns the identifier of the island.
importNodeSets()
importNodeSets();
This method returns a copu of the node sets corresponding to the island. The methods also updates the index set of the island.
gapSize()
gapSize();
The method returns the number of words into the word sequence delimited by the island island but not appearing in the island.
print()
print($fh);
The pethod prints the island into the stream $fh
.
SEE ALSO
Sophie Aubin and Thierry Hamon. Improving Term Extraction with Terminological Resources. In Advances in Natural Language Processing (5th International Conference on NLP, FinTAL 2006). pages 380-387. Tapio Salakoski, Filip Ginter, Sampo Pyysalo, Tapio Pahikkala (Eds). August 2006. LNAI 4139.
AUTHOR
Thierry Hamon <thierry.hamon@univ-paris13.fr> and Sophie Aubin <sophie.aubin@lipn.univ-paris13.fr>
COPYRIGHT AND LICENSE
Copyright (C) 2005 by Thierry Hamon and Sophie Aubin
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.6 or, at your option, any later version of Perl 5 you may have available.