I’ve ran Qiime2 with DADA2 before and I usually would have around 30% of my sequences being filtered out after running DADA2, but this time I got only around 35% of my sequences back (meaning that 65% of my sequences are being filtered per sample).
Looks like someone in the lab used my old github account to post, but I can at least provide some insight here.
We are using the 341F -806R primer pair that I have used in the past for other data sets without much trouble. The merged amplicon is about 418 bp. From the same MiSeq run I had ran 2 other data sets that turned out fine. The data was generated with the adapter+primer PCR, so the primers may be in the read and that is something we can fix and rerun. There were some concerns about some variable quality in the first 100bp of both the forward and reverse reads (nothing major, but a wider spread than normal).
That’s definitely the first thing to try. If the data loss persists after the primers are off, we can revisit, but there’s a very good chance that will fix the issue, as primers with ambiguous nucleotides hanging around can interfere w/ both merging and chimera removal.