NAME
bookcleaner - prepare books for alignment and other operations
SYNOPSIS
bookcleaner [options] file*
bookcleaner [options] file.dbooks
DESCRIPTION
Prepare a textual book (or a list of books in a file with the extension "dbooks", with one book path per line) for future align operations. The following steps are done:
Step1 -- pages, headers footers
Step1 -- pages, headers footers (-p1=0 to skip this step)
Step2 -- sections
Step2 -- sections (-p2=0 to skip this step)
Step3 -- paragraphs
Step3 -- paragraphs (-p3=0 to skip this step)
Step4 -- footnotes
Step4 -- footnotes (Deactivated by default. -p4=1 to perform this step. )
Step5 -- char level cleaning
Step5 -- char level cleaning (-p5=0 to skip this step)
Commit
Commit
Options
-c Commit at the end (removes several debug marks (_pb, etc) before creating output file
-j=1c Just do step 1 and commit
-j=...p Just ... and send output to STDOUT
-simplify to do several char level simplifications:
translate some CP1252 chars to unicode
translate several dashes, quotes and double quotes to ascii
defaul=1
use -simplify=0 to avoid simplification
-v=34 Create temporary output files of the step 3 (file.ou3) and 4 (file-ou4)
-minhf=3 removes headers or footers if they appear more than 3
times (def:5)
-pipe send output to STDOUT
-latin1
-o=FILE send output to FILE (default is original file with extension bc_out)
-dry Dry run (DEBUG option, makes bookcleaner do nothing and just output
the names of the files received as input
-dir=DIR create all output files under DIR/
AUTHOR
Andre Santos
J.Joao Almeida, jj@di.uminho.pt
SEE ALSO
perl(1).
Text::Perfide::BookCleaner(3pm)
Ontology capitulos.the