NAME
Text::Perfide::BookSync - Synchronize books in plain text format.
VERSION
Version 0.01_03
SYNOPSIS
Text::Perfide::BookSync performs a structural alignment at section level of books in plain text format. The books have to be previously annotated by Text::Perfide::BookCleaner.
EXPORT
SUBROUTINES/METHODS
htmlmatrix
Generates an HTML file containing a matrix showing the matches between sections of two books.
marksync
Given two files FILEL and FILER, creates new versions of these files (FILEL.sync and FILER.sync) with synchronization tags <sync id="x"> marking the points where the texts synchronize.
splitchunks
Given two files FILEL and FILER, splits them by their synchronization points, storing each chunk in a file, where each FILEL.cXX matches FILER.cXX.
calchunks
Calculates chunks for a given pair of files. A chunk is a set of consecutive sections, which are grouped in order to match the corresponding chunk.
populate
From a given file in which sections have been delimited with Text::Perfide::BookCleaner, creates and returns a list containing information about the sections of this file: id, start offset and end offset.
moreinfosecs
Calculates metrics on each pair of sections (length in words, ...)
moreinfochunks
Calculates metrics on each pair of chunks (length in words, ...)
load_localrc
AUTHOR
Andre Santos, <andrefs at cpan.org>
BUGS
Please report any bugs or feature requests to bug-text-perfide-booksync at rt.cpan.org
, or through the web interface at http://rt.cpan.org/NoAuth/ReportBug.html?Queue=Text-Perfide-BookSync. I will be notified, and then you'll automatically be notified of progress on your bug as I make changes.
SUPPORT
You can find documentation for this module with the perldoc command.
perldoc Text::Perfide::BookSync
You can also look for information at:
RT: CPAN's request tracker (report bugs here)
http://rt.cpan.org/NoAuth/Bugs.html?Dist=Text-Perfide-BookSync
AnnoCPAN: Annotated CPAN documentation
CPAN Ratings
Search CPAN
ACKNOWLEDGEMENTS
LICENSE AND COPYRIGHT
Copyright 2011 Andre Santos.
This program is free software; you can redistribute it and/or modify it under the terms of either: the GNU General Public License as published by the Free Software Foundation; or the Artistic License.
See http://dev.perl.org/licenses/ for more information.