Hi QIIME2 community!
I am working with Illumina's reads. My demux-paired-end graph looks like this:
I am working with amplicons from V3-V4 region, so reading the "Atacama soil microbiome" it says that "we need the reads to be long enough to overlap when joining paired ends (...) no trimming is being applied to the ends of the sequences to avoid reducing the read length by too much". Taking this advice into account i did not trim the end of sequences with the following command:
*qiime dada2 denoise-paired *
- --i-demultiplexed-seqs demux-paired-end.qza *
- --p-trim-left-f 16 *
- --p-trim-left-r 24 *
- --p-trunc-len-f 300 *
- --p-trunc-len-r 300 *
- --p-chimera-method consensus *
- --o-table table.qza *
- --o-representative-sequences rep-seqs.qza *
- --o-denoising-stats denoising-stats.qza &*
After that, when I checked the denoising-stats file I saw:
As you can see I lose a big number of reads and this fact concerns me because I could be losing reads that would help with bacteria identification.
I tried to trim according to the quality, which means when the quality drops, to see if there is a significant difference with the following command:
*qiime dada2 denoise-paired *
- --i-demultiplexed-seqs demux-paired-end.qza *
- --p-trim-left-f 16 *
- --p-trim-left-r 24 *
- --p-trunc-len-f 300 *
- --p-trunc-len-r 230 *
- --o-table table.qza *
- --o-representative-sequences rep-seqs.qza *
- --o-denoising-stats denoising-stats.qza &*
The denoising stats file is:
In this case, the amount of reads that I lose is less than when I didn't do the trimming applied to the end of sequences.
In one command I wrote the p-chimera option but I don't think that this could be a problem because the parameter I wrote is the default. Which approach do I have to follow to trim the sequences and why?
Thank you very much for your help!
Dani