my sequences do not pass the first filter in DADA2 they have an extremely low percentage

Steven_Criollo · November 16, 2023, 1:11am

hello, I do not understand what is happening, it is the first time that my sequences do not pass the first filter, I would like to understand what may be the reasons for having such a low percentage in the first filter of DADA2, please if someone can give me a hint. I share screenshots of what I’m doing, thank you.

dada2_stats_summ.qzv (1.2 MB)
demultiplexed-sequences-summ.qzv (313.1 KB)

buzic · November 16, 2023, 8:48am

Hi @Steven_Criollo

What read length is your sequencing? for example 250bp paired end? When I look at your demultiplexed-sequences-summ.qzv file it indicates your reads are around 271 in the forward and 270 reverse, and you have set trunc_len_f and trunc_len_r to those values. In the documents for this, found here. It says:

--p-trunc-len-f INTEGER
Position at which forward read sequences should be
truncated due to decrease in quality. This truncates
the 3' end of the of the input sequences, which will
be the bases that were sequenced in the last cycles.
Reads that are shorter than this value will be
discarded. After this parameter is applied there must
still be at least a 12 nucleotide overlap between the
forward and reverse reads. If 0 is provided, no
truncation or length filtering will be performed

So, one issue could be you are throwing away a lot of reads, maybe for a starting point try loosening that threshold and see how you get on?

best,

Vic

Steven_Criollo · November 17, 2023, 9:30pm

Thank you very much, I did´t know that reads that were shorter than the value of trunc_len_f and trunc_len_r were discarted. The problem is solved! thank you!

system · December 19, 2023, 3:30am

This topic was automatically closed 31 days after the last reply. New replies are no longer allowed.