Problems with q2-quality-control using Mockrobiota

Nicholas_Bokulich · June 22, 2019, 5:08pm

Hi @xchromosome,
I think you took a wrong turn pretty early on. It sounds like you were taking the fastq sequence data from mockrobiota, processing it, and then attempting to use that as the "expected composition"? That's not correct — those data are essentially the equivalent of your own, being "observed" compositions of that mock community following some sequencing protocol.

Instead, you need to find the "expected taxonomy" file for that mock community and import that as a relative frequency table.

See this tutorial; it is using a fungal mock community but gives you a good idea of how the "expected taxonomy" from mockrobiota should be used:

yeah, q2-quality-control will show you actual accuracy scores (excel will not), but it will not make a bar chart of the compositions (excel will). You could do this with QIIME 2 taxa barplot, but it would take several steps to get your data in an appropriate format:

filter your feature table to only contain the mock community(s)
import the mockrobiota expected taxonomy as a feature table
merge the filtered and expected tables
make a modified taxonomy file for the expected taxonomy table. You would take the same expected taxonomy you used but copy the columns so you get something like this:

feature-id	Taxon
Bacteria;blah;blah;blah	Bacteria;blah;blah;blah
Bacteria;blah;blah;blah	Bacteria;blah;blah;blah
Bacteria;blah;blah;blah	Bacteria;blah;blah;blah

import that modified file as FeatureData[Taxonomy]
merge the taxonomy file for your observations with that modified taxonomy file
run qiime taxa barplot

Sorry! Not a direct route. mockrobiota was made in a pre-QIIME 2 era and has not been updated yet to conveniently mesh with QIIME 2; and QIIME 2 was certainly not built to cater to the needs of QIIME 2. One day this integration will occur...