NAME

ompa-pa.pl - Extract seqs from BLAST/HMMER interactively or in batch mode

VERSION

version 0.201810

USAGE

ompa-pa.pl <infiles> --database=<file> [optional arguments]

REQUIRED ARGUMENTS

<infiles>

Path to input BLAST/HMMER report files [repeatable argument].

--database=<file>

Path to the sequence database used to generate the reports. For efficiency, this argument must always be the basename of a BLAST database, even when the reports where obtained using hmmsearch on a FASTA file.

To build such a database, use one of the following commands:

$ makeblastdb -in database.fasta -out database -dbtype prot -parse_seqids
$ makeblastdb -in database.fasta -out database -dbtype nucl -parse_seqids

OPTIONAL ARGUMENTS

--report-type=<str>

Type of the reports used as infiles [default: blastxml]. Currently, the following types are available:

- blastxml (XML BLAST reports generated with -outfmt 5)
- hmmertbl (tabular HMMER reports generated with -domtblout)
--colorize=<scheme>

When specified, sequence points are colored after their taxon using the specified CLS file. This requires enabling taxonomic annotation and thus a local mirror of the NCBI Taxonomy database.

--taxdir=<dir>

Path to local mirror of the NCBI Taxonomy database.

To build such a directory, use the following command:

$ setup-taxdir.pl --taxdir=taxdir
--min-cov=<n>

Minimum BLAST query or HMMER model coverage for selected hits [default: 0.7].

--max-copy=<n>

Maximum gene copy number per organism for selected hits [default: 3].

--extract-seqs

Sequence extraction switch [default: no]. When specified, selected sequences are stored into a FASTA file using the same basename as other output files.

--extract-tax

Taxonomy extraction switch [default: no]. When specified, NCBI taxons of selected sequences are stored into a file using the same basename as other output files.

--restore-last-params

Batch-mode switch [default: no]. When specified, parameters are restored from the last saved JSON file.

--restore-params-from=<file>

Batch-mode switch [default: no]. When specified, parameters are restored from the user-specified JSON file.

--print-plots

When specified, plots are printed in PDF format [default: no].

--version
--usage
--help
--man

Print the usual program information

AUTHOR

Denis BAURAIN <denis.baurain@uliege.be>

CONTRIBUTOR

Amandine BERTRAND <amandine.bertrand@doct.uliege.be>

COPYRIGHT AND LICENSE

This software is copyright (c) 2013 by University of Liege / Unit of Eukaryotic Phylogenomics / Denis BAURAIN.

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.