why PICRUSt2 ko_metagenome prediction generates non-integer result?

Lennon_Lee · January 8, 2020, 2:03pm

Dear all,
Thanks for your attention!!

Referring to my previous question: (Alternative way to compare the KOs from PICRUSt2 result) and suggestions from @jwdebelius,

I compare my KOs generated from PICRUSt2 by DEseq2. However, I realize that KO counts from PICRUSt2 are not integers, which generates error while running DEseq2.

I would like to know why PICRUSt2 generates non-integers KO counts? Could I just round up the counts and go for the downstream analysis?

gmdouglas · January 8, 2020, 6:41pm

Hi @Lennon_Lee,

The output predicted abundances are the result of multiplying the normalized # of 16S reads for each ASV by the number of KO copies per predicted genome for that ASV.

The output is non-integers because the the normalized 16S read counts are generated by dividing the raw 16S abundances by the predicted number of 16S copies in that ASV's genome.

I think it's reasonable to round the counts if you want to use DEseq2. I would be more worried about which model to use with DEseq2 because the default settings aren't appropriate for microbiome data in general (you are at risk of generating many false positives in my experience with either taxonomic or functional microbiome data with DEseq2).

Hopefully that helps!

Best,

Gavin

Lennon_Lee · January 9, 2020, 12:36am

GOT IT! I should have checked the DEseq2 paper first! Thank you very much @gmdouglas

system · February 9, 2020, 6:45am

This topic was automatically closed 31 days after the last reply. New replies are no longer allowed.