NAME

bookcleaner - prepare books for alignment and other operations

SYNOPSIS

bookcleaner [options] file*

bookcleaner [options] file.dbooks

DESCRIPTION

Prepare a textual book (or a list of books in a file with the extension "dbooks", with one book path per line) for future align operations. The following steps are done:

Step1 -- pages, headers footers

Step1 -- pages, headers footers (-p1=0 to skip this step)

Step2 -- sections

Step2 -- sections (-p2=0 to skip this step)

Step3 -- paragraphs

Step3 -- paragraphs (-p3=0 to skip this step)

Step4 -- footnotes

Step4 -- footnotes (Deactivated by default. -p4=1 to perform this step. )

Step5 -- char level cleaning

Step5 -- char level cleaning (-p5=0 to skip this step)

Commit

Commit

Options

-c  Commit at the end (removes several debug marks (_pb, etc) before creating output file

-j=1c    Just do step 1 and commit
-j=...p  Just ... and send output to STDOUT

-simplify     to do several char level simplifications:
    translate some CP1252 chars to unicode
    translate several dashes, quotes and double quotes  to ascii
    defaul=1 
    use -simplify=0 to avoid simplification

-v=34  Create temporary output files of the step 3 (file.ou3) and 4 (file-ou4)

-minhf=3 removes headers or footers if they appear more than 3 
     times (def:5)

-pipe   send output to STDOUT

-latin1

-o=FILE send output to FILE (default is original file with extension bc_out)

-dry 	Dry run (DEBUG option, makes bookcleaner  do  nothing and just output 
		the names of the files received	as input

-dir=DIR create all output files under DIR/

AUTHOR

Andre Santos

J.Joao Almeida, jj@di.uminho.pt

SEE ALSO

perl(1).

Text::Perfide::BookCleaner(3pm)

Ontology capitulos.the