NAME
fu-uniq - Dereplicate sequences
VERSION
version 1.4.7
SYNOPSIS
fu-uniq [options] input.fa > uniq.fa
PARAMETERS
--k|keepname
-
Use first sequence name as cluster name(default ON) =item
--p|prefix
[X]Sequence prefix (default 'seq') =item
--s|separator
[X]Prefix and counter separator (default '.') =item
--m|min-size
[N]Print only sequences found at least N times (default '0') =item
--size-as-comment
Add size as comment, not as part of sequence name (default OFF)
General
--help
-
This help =item
--version
Print version and exit =item
--citation
Print citation for seqfu =item
--quiet
No screen output (default OFF) =item
--debug
Debug mode: keep all temporary files (default OFF)
Common seqfu options
--w|line-width
[N]-
FASTA line size (0 for unlimited) (default '80') =item
--strip
Strip comments =item
--fasta
Force FASTA output =item
--fastq
Force FASTQ output =item
--rc
Print reverse complementary =item
--q|qual
[n.n]Default quality for FASTQ files (default '32') =item
--upper
Convert sequence to uppercase
MODERN ALTERNATIVE
This suite of tools has been superseded by SeqFu, a compiled program providing faster and safer tools for sequence analysis. This suite is maintained for the higher portability of Perl scripts under certain circumstances.
SeqFu is available at https://github.com/telatin/seqfu2, and can be installed with BioConda conda install -c bioconda seqfu
CITING
Telatin A, Fariselli P, Birolo G. SeqFu: A Suite of Utilities for the Robust and Reproducible Manipulation of Sequence Files. Bioengineering 2021, 8, 59. https://doi.org/10.3390/bioengineering8050059
AUTHOR
Andrea Telatin <andrea@telatin.com>
COPYRIGHT AND LICENSE
This software is Copyright (c) 2018-2022 by Andrea Telatin.
This is free software, licensed under:
The MIT (X11) License