When to rarefy in picrust2

william_isidorio · January 31, 2019, 12:48am

Hello there,

I was following the picrust2 tutorial for qiime2 (q2 picrust2 Tutorial · picrust/picrust2 Wiki · GitHub) and observed that the rarefaction step is made after we get the predictions. Wouldn't it be better to rarefy the samples before getting the predictions to avoid getting a inflated bias? What are your opinions?

Thanks.

jwdebelius · January 31, 2019, 2:06pm

Hi @william_isidorio,

Okay, the first caveat here is that rarefaction is a complicated discussion in general. That said, the concensus in 2019 is that rarefaction is necessary from most diversity analyses (but not all, check out breakaway plugin). It's also not recommended for feature-based analysis anymore.

If you rarify before you do your PICRUSt prediction, you're going to lose data from the rarefaction. But, you're also going to come out with an uneven depth, because the prediction counts vary based on the gene prediction. So, you might rarify and still discover that your samples don't add up to the same depth.

In terms of bias, rarefaction is still a somewhat random process, so there is the chance that your observed rarified PICRUSt table doesn't perfectly correspond to the organisms in your observed rarefied ASV/OTU table. Beta rarefaction can somewhat alleviate this problem (with the caveat that you're probably going to be happier running this on a smaller dataset), but you are still dealing with some noise in your model due to those differences.

Best,
Justine

william_isidorio · January 31, 2019, 3:08pm

Thank you very much for clarifying this @jwdebelius .

Do you know any references which I could read to learn more about this?

jwdebelius · February 1, 2019, 12:18pm

There's a fair bit of discussion in this thread that might be worth a read.

Best,
Justine