I'm using paired-end reads, here is a breakdown of the first few steps of my workflow, the filtering step happened after the joining step, unless this is not the common practice?
I ran a QC and I know would trim around a length of 250 based on the phred score but otherwise I am not sure what parameters I would adjust in any of the following commands to improve this, as most of my reads are lost after joining. Wouldn't the trimming stage only take away the adapter sequences?
Here is the general workflow of my first few steps before constructing ASVs:
- import with qiime tools import
- trim with command cutadapt trim-paired
- join with command vsearch join-pairs
- filter with command quality-filter q-score
- construct ASVs...
And here are the parameters for the vsearch join-pairs command, but I am unsure if this is what I would have to adjust to receive a higher read count, the trim command does not offer many parameter settings related to read length:
--p-truncqual INTEGER Truncate sequences at the first base with the
Range(0, None) specified quality score value or lower. [optional]
--p-minlen INTEGER Sequences shorter than minlen after truncation are
Range(0, None) discarded. [default: 1]
--p-maxns INTEGER Sequences with more than maxns N characters are
Range(0, None) discarded. [optional]
--p-allowmergestagger / --p-no-allowmergestagger
Allow joining of staggered read pairs.
[default: False]
--p-minovlen INTEGER Minimum overlap length of forward and reverse reads
Range(0, None) for joining. [default: 10]
--p-maxdiffs INTEGER Maximum number of mismatches in the forward/reverse
Range(0, None) read overlap for joining. [default: 10]
--p-minmergelen INTEGER
Range(0, None) Minimum length of the joined read to be retained.
[optional]
--p-maxmergelen INTEGER
Range(0, None) Maximum length of the joined read to be retained.
[optional]
--p-maxee NUMBER Maximum number of expected errors in the joined read
Range(0.0, None) to be retained. [optional]
--p-qmin INTEGER Range(-5, 2, inclusive_end=True)