NAME
bratStatistics.pl - Perl script for printing the statistics of the brat file
SYNOPSIS
bratStatistics.pl [options]
where option can be --help --man --verbose
OPTIONS AND ARGUMENTS
- --input-dir <filename>, -d <filename>
-
Specification of the name of the directory containing the input files. Several directories can be specified.
- --input-file <filename>, -i <filename>
-
Specification of the name of an input file (either the text or annotation file). Several input files can be specified.
- --file-list <filename>, -l <filename>
-
<Specification of the list of files for which the statistics are needed. Each line contains one file name.
- --help
-
print help message for using bratStatistics.pl
- --man
-
print man page of bratStatistics.pl
- --verbose
-
Go into the verbose mode
DESCRIPTION
This script computes the statistics of the brat annotation files (<http://brat.nlplab.org/>). For each file, statistics are: the number of words, the number of terms, the number of relations, the number of terms and relations per type The minimal and the maximal of each statistics, except the types, are also provided.
If no output files are specified, the statistics are printed on the standard output.
EXAMPLES
Print and compute the statistics for all the files in the directory examples
.
bratStatistics.pl -d examples
Print and compute the statistics of all the three files.
bratStatistics.pl -i examples/taln-2012-long-001-resume.txt -i examples/taln-2012-long-002-resume.ann -i examples/taln-2012-long-003-resume.txt
Print and compute all the files indicated in examples/list.txt
.
bratStatistics.pl -l examples/list.txt
SEE ALSO
http://brat.nlplab.org/
AUTHOR
Thierry Hamon, <hamon@limsi.fr>
COPYRIGHT AND LICENSE
Copyright (C) 2015 Thierry Hamon
This is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.4 or, at your option, any later version of Perl 5 you may have available.