ESV vs OTU deflation (QIIME1 vs 2)

Nicholas_Bokulich · May 9, 2020, 10:30pm

Welcome to the forum @nouse1234!

Your results are fairly consistent with expectations and past observations. OTU clustering does a poor job of weeding out sequence errors (or to be more correct, it does not, it just clusters sequences hopefully collapsing some of the rarer errors into hopefully correct centroids). Denoising methods (such as used in QIIME 2) actually remove and/or correct those erroneous sequences, leading to fewer spurious observations. In QIIME 1 we recommended using stringent filtering settings to weed out spurious OTUs; without that, alpha diversity estimates will be vastly overinflated (see the article for details).

Because denoising is removing those spurious reads that otherwise just get clustered into rare but likely incorrect OTUs. Check out the dada2 and deblur original publications for benchmarked examples.

The lack of perfect correlation does not surprise me, since denoising methods are combining filtering as well as (in the case of dada2) error correction, thereby dereplicating reads into fewer ASVs as well as weeding out noise.

Not an issue, an expectation! If you follow the stringent OTU filtering recommendations for qiime1 linked above, you will find somewhat more similar results between OTUs + ASVs.

Here's a ream of other forum reading that might interest you (both past answers to this same question as well as feedback that relates to your specific workflow, esp. the use of clustering after denoising):

I hope that helps!