Denoising with DADA2 - Single runs vs. Pooled

Mike_Stevenson · July 12, 2024, 4:34pm

Hi @buzic - I wonder if I could ask a quick question regarding denoising with DADA2 from the post linked below.

My question is about running the denoising step on individual runs. If I were to do this step on, for example 10 individual runs representing a longitudinal period of a year, would this yield better outputs for downstream analysis, rather than if I were to carry out the denoising step on all 10 runs pooled together?

What would be the negative implications of pooling it vs. individual runs via DADA2?

Many thanks.

SoilRotifer · July 12, 2024, 5:03pm

Hi @Mike_Stevenson,

There has been lots of discussion on which approach to use, and there really isn't a great solution to pooling or not, you may have to change other parameters during pooling, and/or sequencing depth as discussed here.

I personally like to use --p-pooling-method pseudo --p-chimera-method pooled along with --p-min-fold-parent-over-abundance 8, or 16, but not higher, as mentioned here.

Does this help?

Mike_Stevenson · July 12, 2024, 7:48pm

Thanks for the post links, there were very informative. I'll check out these parameters and compare their effects on the outputs with my current outputs (without these parameters). Thanks for your help!

system · August 13, 2024, 1:49am

This topic was automatically closed 31 days after the last reply. New replies are no longer allowed.