NAME

find-compounds.pl - find compound words in a text that are specified in a list.

DESCRIPTION

See perldoc find-compounds.pl

USAGE

find-compounds.pl SourceFile CompoundWordList

INPUT

Required Arguments:

SourceFile

Source file is the original text file.

CompoundWordList

Compound word list contains the compound words. Compound words are seperated by underscore "_". Each compound word is a line.

Examples:

The original text contains "This is the new york city". In the compound word list, it has

new_york
new_york_city

The find-compounds.pl will find the longest match. After replace the compound words, the text is "This is the new_york_city".

Other Options:

--newline

Find compound words within one line boundary with this option. If run find-compounds.pl without this option, find compound words crossing lines.

Displays this message.

--help

Displays this message.

--version

Displays the version information.

AUTHOR

Ying Liu. University of Minnesota at Twin Cities. liux0395@umn.edu

COPYRIGHT

Copyright (c) 2010-2011, Ying Liu

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program; if not, write to

The Free Software Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA 02111-1307, USA.