NAME
Lingua::Stem::UniNE::CS - Czech stemmer
VERSION
This document describes Lingua::Stem::UniNE::CS v0.04.
SYNOPSIS
use Lingua::Stem::UniNE::CS qw( stem_cs );
$stem = stem_cs($word);
# alternate syntax
$stem = Lingua::Stem::UniNE::CS::stem($word);
DESCRIPTION
A light stemmer for the Czech language that removes case endings from nouns and adjectives, possessive adjective endings from names, and takes care of palatalization.
This module provides the stem
and stem_cs
functions, which are synonymous and can optionally be exported. They accept a single word and return a single stem.
SEE ALSO
Lingua::Stem::UniNE provides a stemming object with access to all of the implemented University of Neuchâtel stemmers including this one. It has additional features like stemming lists of words.
Lingua::Stem::Any provides a unified interface to any stemmer on CPAN, including this one, as well as additional features like normalization, casefolding, and in-place stemming.
This stemming algorithm was defined in Indexing and stemming approaches for the Czech language (PDF) by Ljiljana Dolamic and Jacques Savoy and originally implemented by Ljiljana Dolamic in Java.
A Czech stemmer for Snowball by Jimmy O’Regan is available on the Snowball site but not included in the official distribution and therefore not included in Lingua::Stem::Snowball.
ACKNOWLEDGEMENTS
Ljiljana Dolamic and Jacques Savoy of the University of Neuchâtel authored the original stemming algorithm that was implemented in this module.
AUTHOR
Nick Patch <patch@cpan.org>
COPYRIGHT AND LICENSE
© 2012–2013 Nick Patch
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.