NAME

similarity.pl - command line interface to WordNet::Similarity

SYNOPSIS

similarity.pl [--type=TYPE [--config=CONFIGFILE] [--allsense] [--offsets] [--trace] [--wnpath=PATH] [--simpath=SIMPATH] {--interact | --file=FILENAME | WORD1 WORD2} | --help | --version]

DESCRIPTION

This program is a command line interface to the WordNet::Similarity package, which is an implementation of semantic relatedness measures between words. This project began in an effort to replicate the measures described in Budanitsky and Hirst (1995) "Semantic distance in WordNet: An Experimental, application-oriented evaluation of five measures", and has since grown to include additional measures. The measures described and implemented are as follows (those included in Budanitksy and Hirst's work are denoted with a *):

(1) Leacock and Chodorow (1998) *
(2) Jiang and Conrath (1997) *
(3) Resnik (1995) *
(4) Lin (1998) *
(5) Hirst St-Onge (1998) *
(6) Wu and Palmer (1994)
(7) Extended Gloss Overlaps (Banerjee & Pedersen, 2003)
(8) Edge Counting 
(9) Gloss Vector (Patwardhan, 2003)
(10) Random

OPTIONS

--type=type the type of similarity measure. Valid values are

WordNet::Similarity::edge - simple edge counting
WordNet::Similarity::hso - Hirst & St-Onge (1998)
WordNet::Similarity::lch - Leacock & Chodorow (1998)
WordNet::Similarity::lesk - Extended Gloss Overlaps (Pedersen & Banerjee 2003)
WordNet::Similarity::lin - Lin (1998)
WordNet::Similarity::jcn - Jiang & Conrath (1997)
WordNet::Similarity::random - returns random numbers
WordNet::Similarity::res - Resnik (1995)
WordNet::Similarity::vector - Gloss Vector (Patwardhan 2003)
WordNet::Similarity::wup - Wu & Palmer (1994)

--config=configfile the path to a module-specific configuration file

--allsenses Show the relatedness between every sense of the two input words

--offsets show all synsets as offsets and a part-of-speech letter

--trace switches on "Trace" mode. Output goes to stdout.

--interace starts the interactive mode (experimental)

--file=filename input words are read from filename. This file must contain a pair of words on each line. Comments are allowed: anything following // on a line is ignored.

--wnpath=path looks for WordNet in path. Usual values are /usr/local/WordNet/2.1/dict and C:\WordNet\2.1\dict.

--simpath=path look the relatedness module in path. This is useful if the module is locally installed.

--help show a detailed help message

--version show version information

AUTHORS

Ted Pedersen, University of Minnesota Duluth
tpederse at d.umn.edu

Siddharth Patwardhan, University of Utah, Salt Lake City
sidd at cs.utah.edu

Satanjeev Banerjee, Carnegie Mellon University, Pittsburgh
banerjee+ at cs.cmu.edu

Jason Michelizzi, University of Minnesota Duluth
mich0212 at d.umn.edu

BUGS

SEE ALSO

perl(1)

WordNet::Similarity(3)

http://wordnet.princeton.edu

http://wn-similarity.sourceforge.net

http://groups.yahoo.com/group/wn-similarity

COPYRIGHT

Copyright (c) 2005, Ted Pedersen, Siddharth Patwardhan, Satanjeev Banerjee and Jason Michelizzi

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA 02111-1307, USA.