NAME
Lingua::YaTeA::WordOccurrence - Perl extension for managing word occurrence
SYNOPSIS
use Lingua::YaTeA::WordOccurrence;
Lingua::YaTeA::WordOccurrence->new($form)
DESCRIPTION
The module implements a basic representation of word occurrence in the input corpus. It is used in the module Lingua::YaTeA::AnnotationMark.
METHODS
==head2 new()
new($form);
The method creates a new object for managing word occurrence having the inflected form $form
.
setForm()
setForm($form);
The method sets the inflected form ($form
) of the word occurrence.
getForm()
getForm();
The methods returns the inflected form of the word occurrence.
isChunkEnd()
isChunkEnd($action,$split_after,$sentence_boundary,$document_boundary,$chunking_data);
The methods indicates if the word occurrence is the end of chunk (return value 1) or not (return value 0) regarding the sentence and document boundaries ($sentence_boundary
and $document_boundary
), if the chunknig frontier (according to$chunking_data
), or the presence of a forbidden frontier and the related action ($action
values are delete
and split
-- $split_after
indicates if the rank of the word after which the chunk is split).
print()
print($fh);
The method prints into the file hanlder $fh
, the information related to the word occurrence (i.e. its inflected form).
SEE ALSO
Sophie Aubin and Thierry Hamon. Improving Term Extraction with Terminological Resources. In Advances in Natural Language Processing (5th International Conference on NLP, FinTAL 2006). pages 380-387. Tapio Salakoski, Filip Ginter, Sampo Pyysalo, Tapio Pahikkala (Eds). August 2006. LNAI 4139.
AUTHOR
Thierry Hamon <thierry.hamon@univ-paris13.fr> and Sophie Aubin <sophie.aubin@lipn.univ-paris13.fr>
COPYRIGHT AND LICENSE
Copyright (C) 2005 by Thierry Hamon and Sophie Aubin
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.6 or, at your option, any later version of Perl 5 you may have available.