mmvec: strange biplot and heat map results

LenaFloerl · January 12, 2023, 6:17am

Dear :qiime2: community,

I am trying to estimate microbe-metabolite interactions with mmvec (in qiime-2020.6 since tensorflow won't install in newer qiime2 versions) and my biplot and heat map look really strange!

The metabolite data is normalised for an internal octanol standard and each sample measured twice so I used the mean thereof and filtered with frequency >1.
My microbiome data is just a filtered FeatureTable of Fungi.

In the Emperor plot, all taxa align on a single error and Axis 1 has >93%!

In the Heatmap there seems to be a block of negative correlation - but I don't think I should remove any outliers or normalise the input data.

Stranger even, the model diagnostics look pretty good and I get a Pseudo Q-squared of 0.63.

I would appreciate any ideas or input on why my outputs look this way and what I can do to get "better" results.

Thanks for putting together this great plugin!
Lena

###########################################################

I am attaching the files and code I am using:

qiime mmvec paired-omics
--i-microbes zero_filtered_table.qza
--i-metabolites metabolites_filtered.qza
--p-summary-interval 1
--output-dir model_summary

qiime emperor biplot
--i-biplot model_summary/conditional_biplot.qza
--m-sample-metadata-file Volatiles_categories.txt
--m-feature-metadata-file taxonomy.tsv
--p-ignore-missing-samples
--o-visualization emperor.qzv

qiime mmvec heatmap
--i-ranks model_summary/conditionals.qza
--m-microbe-metadata-file taxonomy.tsv
--m-microbe-metadata-column Taxon
--m-metabolite-metadata-file Volatiles_categories.txt
--m-metabolite-metadata-column Category
--p-level 2
--o-visualization mmvec-heatmap.qzv

metabolites_filtered.qza (456.6 KB)
zero_filtered_table.qza (191.9 KB)
Volatiles_categories.txt (7.6 KB)
Uploading: taxonomy.tsv...

mortonjt · January 19, 2023, 9:23pm

It might not be a problem -- I would be curious to see what your alpha / beta diversity for both your microbiome / metabolome looks like. If you have an acute stressor (i.e. antibiotics, or a pathogenic take over), you could see a dramatic shift in occurring in your community (which is what is hinted by your very skewed MMvec PC axes).

A Pseudo-Q2 = 0.63 is very good, I haven't seen many studies that have achieved that level of cross-validation accuracy. So you may have something biologically interesting

LenaFloerl · February 7, 2023, 5:57pm

Hi Jamie,

thanks a lot for your reply!

I double checked with my collaborator who generated the metabolomics data and we now think that some samples were burnt in the GC-MS because he found some Maillard reaction products in some of them.

In the beta diversity PCoA plots of the metabolites one can also see that a few samples are far aways from the cluster of most samples.

I think the way to go now is to remove these outlier samples but I really don't want to just delete individual samples simply because they appear a bit off in the PCoA plots.
Do you have any recommendations on how to approach this?

Thanks again!

I am attaching the PCoA plots with various diversity metrics for the microbiome as well as the metabolite data:

METABOLITES

Bray Curtis

metabolites_PCoA_BC1920×1282 30.8 KB

Jaccard

metabolites_PCoA_jaccard1920×1282 34.3 KB

Aitchison

metabolites_PCoA_aitchison1920×1282 25.4 KB

FUNGI

Jaccard

fungi_PCoA_jaccard1920×1893 76 KB

Bray Curtis

fungi_PCoA_BC1920×1893 56.4 KB

LenaFloerl · February 15, 2023, 6:03pm

Quick update here!

We used Isolation Forests to remove Outliers and our Emperor Plots look now more like what we expected - the arrows stratify better - while retaining high predictive accuracy!

Thanks again for this fantastic plugin!

system · March 19, 2023, 12:03am

This topic was automatically closed 31 days after the last reply. New replies are no longer allowed.