NAME
PLN::PT - interface for the http://pln.pt web service
VERSION
version 0.008
SYNOPSIS
# using as a lib
my $pln = PLN::PT->new('http://api.pln.pt');
my $data = $pln->tagger($txt); # [['A','o',''DA0FS0','0.675415'], ...
# using the pln-pt tool from the command line
$ echo "A Maria tem razão . " | pln-pt tagger
A o DA0FS0 0.675415
Maria maria NCFS000 1
tem ter VMIP3S0 0.999287
razão razão NCFS000 0.65
. . Fp 1
DESCRIPTION
This module implements an interface for the Natural Language Processing (NLP) web service provided by http://pln.pt.
METHODS
new
Create new object, given as argument the base endpoint for the web service.
Once the object is created, a set of methods described below can be used to access several operations in the API. All the methods return a data structure with the corresponding result, typically a list of tokens with some extra information depending on the operation used.
tokenizer
Tokenize the text given as argument, i.e. split the text in tokens (words by default), for more information on the tokenization operation visit http://pln.pt/api.
$ echo "A Maria tem razão ." | pln-pt tokenizer
A
Maria
tem
razão
.
morph_analyzer
Morphologic analysis for a single word, given as argument. For more information on the tagging operation visit http://pln.pt/api.
$ echo cavalo | pln-pt morph
cavalo cavalo NCMS000 0.914011 cavalar VMIP1S0 0.0859891
tagger
Part-of-speech tagging the tokens in the text, given as argument, for more information on the tagging operation visit http://pln.pt/api.
$ echo "A Maria tem razão ." | pln-pt tagger
A o DA0FS0 0.675415
Maria maria NCFS000 1
tem ter VMIP3S0 0.999287
razão razão NCFS000 0.65
. . Fp 1
dep_parser
Build a dependency tree for the text given as argument, for more information on the dependency tree visit http://pln.pt/api.
$ echo "A Maria tem razão ." | pln-pt dep_parser
1 A _ DET art|<artd>|F|S (...) 2 det _ _
2 Maria _ PROPN prop|F|S (...) 3 nsubj _ _
3 tem _ VERB v-fin|PR|3S|IND (...) 0 ROOT _ _
4 razão _ NOUN n|F|S (...) 3 dobj _ _
5 . _ PUNCT punc (...) 3 punct _
tf
Compute term frequency given a text.
stopwords
Retrieve lists of stop words.
ACKNOWLEDGEMENTS
This work is partially supported by the project “SmartEGOV: Harnessing EGOV for Smart Governance (Foundations, methods, Tools) / NORTE-01-0145-FEDER-000037”, supported by Norte Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 Partnership Agreement, through the European Regional Development Fund (EFDR).
AUTHORS
Nuno Carvalho <smash@cpan.org>
Alberto Simões <ambs@cpan.org>
COPYRIGHT AND LICENSE
This software is copyright (c) 2016 - 2017 by Nuno Carvalho.
This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.