First of all, thank you for your support in the past: I finally have mmvec installed properly and working with my own data.
I'm just a bit concerned because my convergence summaries look very different from the example data, as well as from other people's data. I've adjusted various parameters, and I'm just not sure what's going on! I have almost 200 samples, so I don't think that's the issue. My Q2 value is also consistently negative. I realize this might just mean that there isn't a significant relationship in my data, but given my preliminary statistical analysis, I don't think that's the case.
The code I used to run the model that gave me the "best" results:
Hi, it is hard for me to say. The one thing that stands out is that latent-dim=15 is really high. I typically don't much beyond latent-dim=10, particularly for 200 samples.
I'm also wondering if the priors are hurting you here. If you have 200 samples (and your dimensionality isn't crazy high), then you can be more lenient with those priors, and set them higher (i.e. 1). The higher the prior, the less constrained the model is.
Of course, this is highly dependent on how many features you have in your microbe / metabolite tables. If you have more than 10K features, it can be tricky to get a good fit with 200 samples
Thank you so much, this was actually really clarifying. Lowering the latent-dim and increasing the priors helped, but there was still a lot of overfitting.
I actually ended up dividing my data up by treatment group, and have been having better luck that way. However, some of the paired results have shown and...odd representation of the null model, and I'm not sure what to make of it.