Songbird: Problems with graph for null model

Boris_Martin_Tangoa · May 19, 2023, 1:55pm

Hello group

I was running Songbird to identify differently abundant features on my data. When checking how Songbird model fits my dataset, i think my graphs have a nice shape (although Loss graph could me improved).

!qiime songbird multinomial
--i-table Run1/tableRun1.qza
--m-metadata-file metadata.tsv
--p-formula "Treatment"
--p-epochs 10000
--p-differential-prior 0.5
--p-summary-interval 1
--o-differentials Run1/Songbird/differentials.qza
--o-regression-stats Run1/Songbird/regression-stats.qza
--o-regression-biplot Run1/Songbird/regression-biplot.qza

!qiime songbird summarize-single
--i-regression-stats Run1/Songbird/regression-stats.qza
--o-visualization Run1/Songbird/regression-summary.qzv

!qiime tools view Run1/Songbird/regression-summary.qzv

When running the null model i obtain a pseudo Q-squared of 0.421411, but the orange line on the graphs doesn't look right to me.

Generate a null model

!qiime songbird multinomial
--i-table Run1/tableRun1.qza
--m-metadata-file metadata.tsv
--p-formula "1"
--p-epochs 10000
--p-differential-prior 0.5
--p-training-column Testing
--o-differentials Run1/Songbird/null-diff.qza
--o-regression-stats Run1/Songbird/null-stats.qza
--o-regression-biplot Run1/Songbird/null-biplot.qza

Visualize the first model's regression stats and the null model's

regression stats

!qiime songbird summarize-paired
--i-regression-stats Run1/Songbird/regression-stats.qza
--i-baseline-stats Run1/Songbird/null-stats.qza
--o-visualization Run1/Songbird/paired-summary.qzv

!qiime tools view Run1/Songbird/paired-summary.qzv

Any ideas why this could be happening? I know the Q-squared should be positive, but is it okay with a graph like that?
Best regards
Boris

mortonjt · May 19, 2023, 2:45pm

Did you include the wrong plots? I don't see the orange trend.

Boris_Martin_Tangoa · May 22, 2023, 9:42am

Sorry, attached the wrong graph.

mortonjt · May 22, 2023, 10:43am

hmm. I have not seen a plot like that before -- looks like something is off with the summary-interval (i.e. the timing of recording the metrics is off ...)

But given the the cross-validation score has flat lined for both models, I'd say the run is solid, so you should be able to run with it.

Boris_Martin_Tangoa · May 22, 2023, 11:36am

Hi i realized i made a mistake on my script for the null model (--p-training-column Testing), as i was not using the Testing column for the first set of graphs.

Now i am wondering which aspect should i prioritize, graph shape or Q-squared value
With the following script i obtain a Q-squared of 0.053696 but the loss option has peaks and valleys.

!qiime songbird multinomial
--i-table Run1/tableRun1.qza
--m-metadata-file metadata.tsv
--p-formula "Treatment"
--p-epochs 20000
--p-differential-prior 0.05
--p-summary-interval 1
--o-differentials Run1/Songbird/differentials.qza
--o-regression-stats Run1/Songbird/regression-stats.qza
--o-regression-biplot Run1/Songbird/regression-biplot.qza

!qiime songbird summarize-single
--i-regression-stats Run1/Songbird/regression-stats.qza
--o-visualization Run1/Songbird/regression-summary.qzv

Generate a null model

!qiime songbird multinomial
--i-table Run1/tableRun1.qza
--m-metadata-file metadata.tsv
--p-formula "1"
--p-epochs 20000
--p-differential-prior 0.05
--p-summary-interval 1
--o-differentials Run1/Songbird/null-diff.qza
--o-regression-stats Run1/Songbird/null-stats.qza
--o-regression-biplot Run1/Songbird/null-biplot.qza

Visualize the first model's regression stats and the null model's

regression stats

!qiime songbird summarize-paired
--i-regression-stats Run1/Songbird/regression-stats.qza
--i-baseline-stats Run1/Songbird/null-stats.qza \

Boris_Martin_Tangoa · May 22, 2023, 11:42am

When setting --p-learning-rate to 0.0001, loss function looks a bit better but Q-squared reduces to 0.023946.

mortonjt · May 23, 2023, 1:50pm

I'd prioritize Q2. So I think your earlier model is better.