I tried to use the minimum overlap parameter, I set --p-min-overlap 4, actually the result doesn't change too much. It is basically the same as the no truncation result. Seems I can't do anything else to improve my results? Do you have any suggestions? Thanks!
Based on your results, you are still loosing reads on merging step after filtering. From one side, you have enough reads to proceed, from another - you results may be biased towards bacteria with shorter amplicons.
So option 1 will be just proceed with disabled truncation by Dada2 with a risk of biased data.
Option 2 is to use only forward reads. This way you will keep most of the reads but taxonomy annotations will be performed with shorter reads.
Option 3 is to try to merge your reads with vsearch plugin, where you can not only decrease overlapping region but also allow some mismatches and then denoise by Deblur to see if it will improve the output.
Then I run it on dada2, also I indicate maxExpectedError to 2 and 3, here is what I got
No reads passed the filter. trunc_len_f (238) or trunc_len_r (249) may be individually longer than read lengths, or trunc_len_f + trunc_len_r may be shorter than the length of the amplicon + 12 nucleotides (the length of the overlap). Alternatively, other arguments (such as max_ee or trunc_q) may be preventing reads from passing the filter.
Then I realize, maybe it's because I forgot to set the primer length in figaro as I'm using the original data without trimming. But it still recommend me the same truncation position.
Figaro seems like a great tool to know the truncation value, but I got confused about the result I got when running dada2.
Hi, I've had a similar problem with some soil DNA. What I found worked best was to use only FWD reads and adjust the p=min to 4. Then I was able to keep more than 70% of reads after DADA2, whereas before adjusting I was only keeping between 30-40%. Interestingly, this is only happening for certain soil samples, other soil samples I collected in a different location are working much better. Just out of curiosity, what kind of soil/DNA extraction method are you using?
Thanks for your suggestion, I used V3V4, 341F-804R region, and the forward/reverse reads are around 245bp. So it's because the primer sets using or it's soil DNA caused only FWD reads are the best case you've got? I used powersoil kit (Qiagen) to extract my DNA.
I have used both the PowerSoil and a phenol:chloroform extraction method - the PowerSoil does seem to give me better quality DNA, especially after I add some powdered skim milk to the bead-beating step. I'm sequencing reads on a MiSeq v3 600, also with the 341F-804R primer pairs. The extractions I performed with the PowerSoil kit don't need the p-min drop to 4 to keep over 70% of reads after DADA2. But I do just use the FWD reads, the REV reads I usually get are just not good quality.
I'm working with similar data (DNA extracted with PowerSoil, same primer pair, plus a organellar blocking primer set, Miseq sequencing),
I also lost a lot of data during dada2 step (see image below) even after trying all variations of metaparameters (FYI, I truncated fwd at 290 and rev 260, trimmed first 10 bp, this last one helped a lot),
I might sound mean, but with the combination of protocols we used, apparently we get low throughput, I even got a sample which only a 37% survived after all dada2 process.
Sorry for the late reply. I decided to use no truncation and minimum overlap value even though it still loses many reads. But most of the samples still remain 20,000 reads. I assume it's still enough. Thanks for all your help!
Hi @timanix
Thank you for the information you provided in your answers, I'm new to qiime and bioinformatics and you have helped a lot. I was wondering if you could help me out with this same issue I am having.
So I used the 515F and 806RB primers and The V4 region is ~252 nts long. Add 39 (515F=19, 806R=20) bases of primers and you are up to 291 nts. I only have 2x150 = 300 nts of total sequencing, so just ~9 nts of overlap.
With the default -p-min-overlap setting I was getting near 0% merging, I then tried -p-min-overlap 6 and I'm getting much more merging. I tried truncating at 129 for forward read and 130 for reverse read, the merging is 60% to 80% but I have a lot of 40% and it is worrying me so I tried to truncate at 0 and I am getting much better merging. the quality after trimming the primers using cutadapt was not bad, do you think I should use trunc =0? or should I lower the min overlap some more and does lowering the minimum overlap affect the result negatively?
Hi @Nayla_Higazy ,
Looks you already managed to get it working! If disabling truncating parameter by providing a 0 gives you better output, you should use it. Your last output looks fine to me as it is already and you can proceed with it.
If you still want to try to adjust the settings, I would:
Set trunc parameter to 0 for forward reads and 140 for reverse with 4 bp as overlap
Run with 0 for both trunc parameters and 4 as overlap
Choose the best parameters from your settings you already run and a new ones.