There is a directory with paired reads, the names and number of pairs are unknown, the possible file extensions are .fastq or .fastq.gz.
I need advice on how best to get paired reads in a directory for processing in the snakemake pipeline.
There is a directory with paired reads, the names and number of pairs are unknown, the possible file extensions are .fastq or .fastq.gz.
I need advice on how best to get paired reads in a directory for processing in the snakemake pipeline.
Copyright © 2021 Jogjafile Inc.
There are multiple ways to go about this. My preferred method is to use a CSV to specify read paths. An added benefit of this is that you can add metadata to each set of reads that you can then access in your workflow.
Here is an example for mapping reads for three samples:
A, B, CDirectory structure:
sample_reads.csv:Snakefile: