How to interpret quality scores?

Joeee · November 6, 2024, 4:33pm

Hi, I'm looking at some 16s microbial data and got the following set of quality scores. Initially I tried trimming these samples at 200 (forward) and 180 (reverse), but my feature table frequency was an average of 25 counts with a peak of 750 (and many, many counts at 1).

P.s. I presume I over-filtered my data, so I want to ask directly how best to interpret these quality scores.

Help Interpreting Interactive Quality Plot - User Support - QIIME 2 Forum I've also seen this post about the binned quality scores, but I'm not sure what specific scores are considered 'good'. I assume its 30 Sequencing Quality Scores but just wanted to double check.

Thanks

colinvwood · November 6, 2024, 5:58pm

Hello @Joeee,

These indeed look like "binned" quality scores. If you used dada2 to generate the feature table, I would recommend looking at the dada2 stats visualization to gauge the impact of your truncation parameters. Try rerunning with other truncation parameters and then compare. If I were using dada2 on these data I wouldn't truncate at all. There is really no definition of what constitutes a "good" quality score. Different people have different rules of thumb. Whatever parameters (within reason) to dada2 give you the highest percent of reads retained is what I would move forward with.

Joeee · November 7, 2024, 1:40pm

Thanks, makes sense. I've only used FASTQC before this so things are quite visually different from what I'm used to!

system · December 8, 2024, 7:40pm

This topic was automatically closed 31 days after the last reply. New replies are no longer allowed.