Hi,
I would like to discuss with you the genesis analysis I want to adopt. I have 16s hiseq sequences data analysed in qiime 2, I got table.qza, rep-seq.qza and taxonomy.qza for both old samples and new samples. I wanted to see if time changes the abundance of my bacteria taxa in my old and new samples, also whether this is significant.
when I did the analysis with this command without collapsing my taxa I got misleading outcomes as same taxa were showing in nominator and denominator see attached:
qiime gneiss balance-taxonomy
--i-table composition.qza
--i-tree hierarchy.qza
--i-taxonomy taxonomy-.qza
--p-taxa-level 6
--p-balance-name 'y0'
--m-metadata-file meta.txt
--m-metadata-column group
--o-visualization y0_taxa_summary.qzv
That is a confusing output, but it makes absolute sense and is not misleading when you consider what gneiss is comparing here.
You are running gneiss on a feature table of sequence variants that have had taxonomic lineages assigned to them. There is no reason to expect that each unique sequence variant would have its own unique taxonomic lineage, and in fact many are going to be assigned the same exact lineage, some even the same species. It looks like you are examining genus-level results in gneiss, so it is absolutely sensible to expect that there will be multiple sequence variants with the same genus assignment, and there is no reason to expect that all of those will display the same behavior.
Nothing misleading here. You have multiple sequence variants with the same taxonomic assignments that behave differently under your experimental conditions.
Did you see this topic? Look around — it is possible but only with some level of effort. What you are doing now is the "correct" way to do things. Collapsing by taxonomy will reduce potentially important information.
thanks a lot for your quick reply Nicholas_Bokulich,
Yes I saw the post before posting this, but unfortunately it did not answer clearly to my concern.
what I want is to get OTU differences between my samples at genus level 6, if it is not significant then I can move down to species level, I don’t know if it is equivalent to level 7 or 8…
I am not interested to check ASVs at this stage. Do you know how to do that in gneiss ?
I got table.collapsed.qza which is equivalent to OTU table. But I do not know how to collapse.taxa or how to collapse rep.seq.qza.
thanks a lot for your support.
This type of operation is not supported in q2-gneiss at the moment, so you would need to do some custom analysis to rig up your data. There is no way to collapse rep.seqs — it just does not make sense. You can "collapse" taxonomy.qza by exporting that file, trimming to level 6, finding all unique lineages, and exporting a new taxonomy file where the level 6 lineage is both the feature ID and the taxonomy. Import that and it should correspond. This is outside of QIIME2 and really a custom operation, so we can't provide specific support on how to do it.
I collapsed the taxonomy outside qiime 2 then imported to match the table collapsed at level 6, but I am still receiving the same error:
attached the files used: composition-sg1-level6-only.qza (48.6 KB) Taxonomy-l6-sg1.qza (245.1 KB)
the command and the error:
qiime gneiss balance-taxonomy
--i-table composition-sg1-level6-only.qza
--i-tree hierarchy-sg1-level6-only.qza
--i-taxonomy Taxonomy-l6-sg1.qza
--p-taxa-level 6
--p-balance-name 'y0'
--m-metadata-file /mnt/beegfs/scratch/eh309/analysis-lane1and2/ssa1sg1/ssa1sg1-meta.txt
--m-metadata-column group
--o-visualization y0_taxa_summary.qzv
Plugin error from gneiss:
"None of {..} are in the [index]"
The index is not matching up — the feature IDs are likely different between your files, since you did some custom processing. You will need to make sure that the feature IDs in your feature table and taxonomy match exactly.
Thanks a lot, it worked fine.
I have another concern regarding the output. I have 20 taxa as input table.qza but in the output I see only 15 taxa and 5 taxa are missing. Is there a restriction on the maximum limit of taxa in the output ?
I am expecting to see all the proportions of my taxa in the final view: y0_taxa_summary.qzv.
Thanks for your help.
In the tree I have 21 taxa, which is good as nothing was missing. But in the final view 5 of my taxa were filtered and one of them is the second abundant feature in my samples. I can’t understand why were filtered.
@Hajar, I’m looking at your summaries, and I still am having trouble understanding the problem. Are you referring to the taxonomy or proportion plots? If that is the case, it is important to note that this package is not a visualization library, so you may have to extract out the data to plot in something like Python or R.