Using qiime taxa collapse Before Logistic Regression: Is It Appropriate?

Hi @jau,

I think the answer is It Depends :tm: on several things.

The answer to this is yes. You can absloutely collapse your data to genus level before doing... whatever. You're making a set of assumptions which are discussed in the linked posts below (these are are a starting place.)

There's a second issue in your question, and that's about the appropriate use of logistic regression in microbiome data. I assume you 're using the taxa to predict an outcome? This might also be something to consider. I appreciate that the standard consideration in models that if your exposure (e.g. microbiome) should predict your outcome (e.g. disease) but this can be more complicated.

If helpful, there's also a discussion about that on here:

https://forum.qiime2.org/t/modeling-bacterial-features-as-indepedent-variables/25766/8

Again, happy to discuss further in general terms and as always happy to have others involved!

Best,
Justine

3 Likes