sample_rule
Rule to find files/samples
maxdepth
Like find -max_depth
sample_glob
Alternately, instead of sample_rule use sample_glob to find files. Glob does not differentiate between files and directories.
find_sample_bydir
#Previous find_by_dir
Use this option when you sample names are by directory The default is to find samples by filename
/SAMPLE1
SAMPLE1_r1.fastq.gz
SAMPLE1_r2.fastq.gz
/SAMPLE2
SAMPLE2_r1.fastq.gz
SAMPLE2_r2.fastq.gz
by_sample_outdir
No change - previously by sample outdir
Preface outdir with sample
Instead of
outdir/
rule1
rule2
outdir/
Sample_01/
rule1
rule2
samples
This is our actual list of samples
resample
Boolean value get new samples based on indir/sample_rule or no
Samples are found at the beginning of the workflow, based on the global indir variable and the file_find.
Chances are you don't want to set resample to true. These files probably won't exist outside of the indirectory until the pipeline is run.
One example of doing so, shown in the gemini.yml in the examples directory, is looking for uncompressed files, .vcf extension, compressing them, and then resampling based on the .vcf.gz extension.
sample_files
Infiles to be processed