Hello,
Currently I have an abundance table which displays the abundance of the feature ids. I am wondering if there is a way to replace those feature ids with the specific taxonomic identifiers they map to?
Thank you
Hello,
Currently I have an abundance table which displays the abundance of the feature ids. I am wondering if there is a way to replace those feature ids with the specific taxonomic identifiers they map to?
Thank you
Hi @camth! You can use qiime taxa collapse
to create a feature table with taxonomic annotations as its feature IDs. The features within each taxon will have their abundances summed in the resulting table (this is what we mean by "collapsing" at a particular taxonomic level).
Side note: qiime taxa collapse
is very similar to summarize_taxa.py
in QIIME 1.
Hi Thank you for the reply. However upon trying the qiime taxa collapse
it output a table without feature ids or taxonomic annotations only abundances. Upon comparing the tutorial files to mine I saw my files lacked the k_,p_,c_,etc before the identifier. I went back in the original files to add these and repeat the classifier training step only now it has been running for six hours instead of the much shorter (less than an hour) original time. I am wondering is this time period normal and would the addition of those k_,p_ rectify the empty table issue?
Thank You
Hi @camth!
Hmm, something doesn't sound quite right here. Can you try the following:
$ qiime taxa collapse \
--i-table feature-table.qza \
--i-taxonomy taxonomy.qza \
--p-level 2 \
--o-collapse-table collapsed-table.qza
$ qiime feature-table summarize \
--i-table collapse-table.qza \
--o-visualization collapsed-table-summary.qzv
The third tab in the resulting visualization ("Feature Detail") has a table of the features, with the Feature ID replaced with the collapsed taxonomic identifier:
Can you provide a little more detail here about this? What reference database are you using? Not all databases use the k_;p_;c_
notation, so it shouldn't be a problem that you don't have that. The only requirement is that the identifiers be ;
(semi-colon) delimited.
Thanks!
Hello Good Afternoon,
When trying to run the qiime collapse on the original files I received the following error:
Plugin error from taxa:
Requested level of 2 is larger than the maximum level available in taxonomy data (1).
The reference file is a custom built one for the project so in order to provide identifiers I inserted the k;p;c; notation and attempted to re-train the classifier by I receive the following error:
Plugin error from feature-classifier:
not enough values to unpack (expected 2, got 0)
I am not quite sure how to rectify these errors.
Hi @camth, can you please provide an example of your custom reference file? Just a few lines should be enough. As well, can you provide us with the command you are using to train the classifier using this reference DB? Thanks!
Hello thank you
The reference fasta files look like this
>PROS91
GAGCAGATAGTTTACTTTAACTGTTATGTGGTTCTGGATGCGGGTGATCACAAAGACCTGAAG....
>A1524
TAGCAGATAGTCTATTTTAACTGCTACGTGGTGCTGGATCCCGGCGACCACAAAGAGCTGAA....
I also have the main sequence fasta file
then the taxonomy file looks like this
SYN20 SYN1
PROS-9-1 SYN1
ROS8604 SYN1
MVIR-18-1 SYN1
WH8020 SYN1
then i am using
qiime feature-classifier fit-classifier-naive-bayes \
--i-reference-reads ref-seqs.qza \
--i-reference-taxonomy ref-taxonomy.qza \
--o-classifier classifier.qza
Hi @camth! It looks like your taxonomy annotations only go one level deep (e.g. SYN1
, vs SYN1;foo;bar
). You should be able to collapse on level 1:
$ qiime taxa collapse \
--i-table feature-table.qza \
--i-taxonomy taxonomy.qza \
--p-level 1 \
--o-collapse-table collapsed-table.qza
which will effectively group based on those labels. Let us know how it goes!
This topic was automatically closed 31 days after the last reply. New replies are no longer allowed.