NAME

mm-xml2aw-xml.pl - This program converts MetaMap xml (mm-xml) formatted text into the all words xml (aw-xml) format.

SYNOPSIS

This program converts MetaMap xml (mm-xml) formatted text into the all words xml (aw-xml) format.

USAGE

perl mm-xml2aw-xml.pl SOURCE DESTINATION

SOURCE

DESTINATION

Optional Arguments:

--log DIRECTORY

Directory to contain temporary and log files. DEFAULT: log

--help

Displays the quick summary of program options.

--version

Displays the version information.

OUTPUT

All words xml format similar to the SemEval all words disambiguation task. In this format, each term assigned one or more concepts in the metamap xml file are outputed as follows:

<?xml version="1.0"?>
<!DOCTYPE corpus SYSTEM  "all-words.dtd">
<corpus lang="en">
<text id="001">
<head id="d001.s001.t001" candidates="C1280500,C2348382">effect</head>
of
the
<head id="d001.s001.t004" candidates="C0449238">duration</head>
</text>
</corpus>

There exists an addition to the regular SemEval format. The candidate tags contain each possible sense of the term assigned by metamap. These will be used as the possible senses in the umls-allwords-senserelate.pl program when using the --candidate option. Otherwise, the senses come from doing a dictionary lookup in the MRCONSO table of the UMLS.

PROGRAM REQUIREMENTS

  • Perl (version 5.8.5 or better) - http://www.perl.org

AUTHOR

Bridget T. McInnes, University of Minnesota, Twin Cities

COPYRIGHT

Copyright (c) 2007-2008,
Bridget T. McInnes, University of Minnesota, Twin Cities
bthomson at cs.umn.edu

Ted Pedersen, University of Minnesota Duluth
tpederse at d.umn.edu

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program; if not, write to

The Free Software Foundation, Inc.,
59 Temple Place - Suite 330,
Boston, MA  02111-1307, USA.