NAME
inst-tax-filter.pl - Apply a taxonomic filter to a (UniProt) FASTA database (optimized)
VERSION
version 0.242020
USAGE
inst-tax-filter.pl <infiles> --filter=<file> --taxdir=<dir>
[optional arguments]
REQUIRED ARGUMENTS
- <infiles>
-
Path to input FASTA files [repeatable argument].
- --filter=<file>
-
Path to an IDL file specifying the taxonomic filter to be applied.
In a tax_filter, wanted taxa are to be prefixed by a '+' symbol, whereas unwanted taxa are to be prefixed by a '-' symbol. Wanted and unwanted taxa are linked by logical ORs.
An example IDL file follows:
-Viridiplantae -Opisthokonta +Ascomycota +Oomycota
- --taxdir=<dir>
-
Path to local mirror of the NCBI Taxonomy database.
OPTIONAL ARGUMENTS
- --id-regex=<str>
-
Regular expression for capturing org from seq id [default: none].
The argument value can be either a predefined regex or a custom regex given on the command line (do not forget to escape the special chars then). The following predefined regexes are available (assuming a leading '>'):
- :UNI (UniProt OS= field)
- --version
- --usage
- --help
- --man
-
Print the usual program information
AUTHOR
Denis BAURAIN <denis.baurain@uliege.be>
CONTRIBUTOR
Valerian LUPO <valerian.lupo@doct.uliege.be>
COPYRIGHT AND LICENSE
This software is copyright (c) 2013 by University of Liege / Unit of Eukaryotic Phylogenomics / Denis BAURAIN.
This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.