NAME

bratStatistics.pl - Perl script for printing the statistics of the brat file

SYNOPSIS

bratStatistics.pl [options]

where option can be --help --man --verbose

OPTIONS AND ARGUMENTS

--input-dir <filename>, -d <filename>

Specification of the name of the directory containing the input files. Several directories can be specified.

--input-file <filename>, -i <filename>

Specification of the name of an input file (either the text or annotation file). Several input files can be specified.

--file-list <filename>, -l <filename>

<Specification of the list of files for which the statistics are needed. Each line contains one file name.

--help

print help message for using bratStatistics.pl

--man

print man page of bratStatistics.pl

--verbose

Go into the verbose mode

DESCRIPTION

This script computes the statistics of the brat annotation files (<http://brat.nlplab.org/>). For each file, statistics are: the number of words, the number of terms, the number of relations, the number of terms and relations per type The minimal and the maximal of each statistics, except the types, are also provided.

If no output files are specified, the statistics are printed on the standard output.

EXAMPLES

Print and compute the statistics for all the files in the directory examples.

bratStatistics.pl -d examples

Print and compute the statistics of all the three files.

bratStatistics.pl -i examples/taln-2012-long-001-resume.txt -i examples/taln-2012-long-002-resume.ann -i examples/taln-2012-long-003-resume.txt

Print and compute all the files indicated in examples/list.txt.

bratStatistics.pl -l examples/list.txt

SEE ALSO

http://brat.nlplab.org/

AUTHOR

Thierry Hamon, <hamon@limsi.fr>

COPYRIGHT AND LICENSE

Copyright (C) 2015 Thierry Hamon

This is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.4 or, at your option, any later version of Perl 5 you may have available.