Hi,
I am a newbie in microbiome analysis. Can someone help me with choosing the correct parameters for dada2 step? My dataset is 16s with V4 region, using Forward primer sequencing: 515F and 806R primer sequences. Before I start the qiime2 pipeline I used cutadapt to remove the primers sequences and bbmap "repair.sh" for fixing disordered paired reads.
In the dada2 step, this is the code I used. I have tried different parameter and so far this is my best result. I want to increase the read to about 80% if it is possible but I dont know how to get there without cutting too much reads.
Thank you for posting your read quality scores and dada2 stats. Your read quality looks great!
To get more of your reads to pass the filter, not only can you trim to a shorter length like you have done, you can also increase the --p-max-ee-f and --p-max-ee-r settings to allow reads with more expected errors. The default setting is 2 but increasing this to 3 could get more of your reads to join, hopefully up to 80% like you mentioned.
Given the quality of your reads, you could also increase your --p-trunc-len-* settings to keep more of your read area during merging.
Thank you so much! I actually went all the way back to the cutadapt step and loose my quality parameter a little bit and on top of that I also did
--p-max-ee-f 3
--p-max-ee-r 5
Now I am able to have more reads passed the dada2 step.