I am analyzing a batch of single-end third-generation 16S sequencing data, which has been demultiplexed.
The Interactive Quality Bar Plot and the demux.qzv file obtained after import are shown below:
However, both the bar plot and the seven-number summary suggest that the quality scores for these reads are much higher than 40. Is this normal?)
I consistently observe a low percentage of input passed filter when using denoise-ccs for data denoising.
I have experimented with adjusting several parameters, including setting --p-trunc-len 1520 and --p-trim-left 10, but the filter percentage hasn't increased.
The command I used and the denoising statistics are as follows:
This is a little higher than usual. Usually if it was the wrong Phred score you will get a warning on the demux.qzv about it. Did you get that warning?
Check out --p-min-fold-parent-over-abundance. Maybe that will help you! Reminder though, We dont want chimeras in our sequences, so be aware that as you are messing around with parameters you are most likely making it easier for chimeras to slip through!
--p-min-fold-parent-over-abundance NUMBER
The minimum abundance of potential parents of a
sequence being tested as chimeric, expressed as a
fold-change versus the abundance of the sequence
being tested. Values should be greater than or equal
to 1 (i.e. parents should be more abundant than the
sequence being tested). Suggest 3.5. This parameter
has no effect if chimera-method is "none".
[default: 3.5]
I also notice from your screen shot that your filter passing seems low as well. Are the majority of your samples getting 50% or less sequences pass the filtering step? If so, I might try messing around more with --p-trunc-len to see if you can get more sequences past the filtering step.
I hope that helps!