NAME
Text::TEI::Collate::Lang - base class for collation language-specific extensions
DESCRIPTION
Text::TEI::Collate::Lang is the base package for any language extension (e.g. Text::TEI::Collate::Lang::Latin) to be used by Text::TEI::Collate. The base package provides three subroutines as default; any implementation should re-implement one or more of these functions, and can use the ones defined here otherwise. This would be a base class to subclass if we ever had a reason to instantiate it.
SUBROUTINES
distance
This is a rudimentary, and hopefully pretty quick, word distance function. It counts the occurrence of each letter in a word, and returns the sum of lettercount differences between the two passed words.
canonizer
This is essentially just the lc() builtin function.
comparator
This is a function that replaces all characters with their base character after an NFKD (Normalization Form Compatibility Decomposition) operation.
AUTHOR
Tara L Andrews <aurum@cpan.org>
1 POD Error
The following errors were encountered while parsing the POD:
- Around line 37:
Non-ASCII character seen before =encoding in ''αι̣τια̣ν̣','. Assuming CP1252