NAME

tag-loc-ids.pl - Compute seq id organelle tags based on sequence identity

VERSION

version 0.210170

USAGE

tag-loc-ids.pl <infiles> [optional arguments]

REQUIRED ARGUMENTS

<infiles>

Path to input FASTA files [repeatable argument].

OPTIONAL ARGUMENTS

--in[-strip]=<str>

Substring(s) to strip from infile basenames before attempting to derive other infile (e.g., IDM files) and outfile names [default: none].

--percent-id=<n>

Min percentage identity to consider a hit.

--fasta[-suffix]=<suffix>

Suffix to append to infile basenames for deriving outfile names [default: none]. When not specified, outfile names are taken from infiles but original infiles are preserved by being appended a .bak suffix.

--pruning

If activated produce a list of sequence IDs for pruning of alignment ('.idl' file). By default, output is a '.idm' file for renaming sequences in original fasta/ali file.

--loc=<loc>

for Euclid: loc.type: str

Cellular compartment you want to work with.

--count=<str>

Prot count per plastome file.

for Euclid: str.type: str

--pid_only
--version
--usage
--help
--man

Print the usual program information

AUTHOR

Denis BAURAIN <denis.baurain@uliege.be>

CONTRIBUTOR

Mick VAN VLIERBERGHE <mvanvlierberghe@doct.uliege.be>

COPYRIGHT AND LICENSE

This software is copyright (c) 2021 by University of Liege / Unit of Eukaryotic Phylogenomics / Denis BAURAIN.

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.