Hello,
I'm running “Atacama soil microbiome” tutorial and have a question regarding where to trim and truncate using DADA2.
I was trying to figure out why you chose 13 and 150 sequence base here:
qiime dada2 denoise-paired \
--i-demultiplexed-seqs demux.qza \
--p-trim-left-f 13 \
--p-trim-left-r 13 \
--p-trunc-len-f 150 \
--p-trunc-len-r 150 \
--o-table table.qza \
--o-representative-sequences rep-seqs.qza \
--o-denoising-stats denoising-stats.qza
I used this command to get a demux.qzv as I want to see the quality plots. I got the demux.qza from this link. demux.qza
This file is provided right before the above commands I found on the tutorial page.
qiime demux summarize --i-data demux.qza --o-visualization demux.qzv
Next, I looked at the quality plots from demux.qzv file. Here is a screenshot.
If I choose the first sequence base that has a quality score lower than 30, then it would be 142 for forward reads, and 4 for reverse reads.
How would you determine that 150 and 13 is the appropriate sequence base to trim and truncate?
Thanks!