Wondering what I've done wrong in feeding a taxonomy.qza file into qiime rescript evaluate-taxonomy
to generate the output shown in the table below. The number of unique labels and entropy calculations seem fine, but I don't understand where I've failed with regards to the terminal labels and unclassified labels categories.
Level | Unique_Labels | Taxonomic_Entropy | Terminal_Labels | Proportion_Terminal_Labels | Classified_Labels | Proportion_Classified_Labels | Unclassified_Labels | Unclassified_Labels |
---|---|---|---|---|---|---|---|---|
1 | 1 | 0 | 0 | 0 | 24561 | 1 | 0 | 0 |
2 | 1 | 0 | 0 | 0 | 24561 | 1 | 0 | 0 |
3 | 14 | 1.492 | 0 | 0 | 24561 | 1 | 0 | 0 |
4 | 173 | 3.87 | 0 | 0 | 24561 | 1 | 0 | 0 |
5 | 1089 | 5.83 | 0 | 0 | 24561 | 1 | 0 | 0 |
6 | 6416 | 7.91 | 0 | 0 | 24561 | 1 | 0 | 0 |
7 | 17672 | 9.36 | 24561 | 1 | 24561 | 1 | 0 | 0 |
I would have expected unclassified labels at Species (4231), Genus (2515), Family(2172), Order (259), and Class (2). Yet these are not being reported.
My initial thought was that my taxonomy file I imported when creating this .qza object used incorrect rank handles. The initial file was structured like this:
Feature ID Taxon
10013526 tax=k__Animalia;p__Chordata;c__Actinopterygii;o__Perciformes;f__Serranidae;g__Caesioperca;s__Caesioperca rasor
10013530 tax=k__Animalia;p__Chordata;c__Actinopterygii;o__Tetraodontiformes;f__Tetraodontidae;g__Contusus;s__Contusus brevicaudus
10013534 tax=k__Animalia;p__Chordata;c__Actinopterygii;o__Perciformes;f__Cheilodactylidae;g__Cheilodactylus;s__Cheilodactylus variegatus
so I then thought that maybe the tax=
prefix to the Taxon field was causing the error. I removed that portion so that the next data set looked like this:
Feature ID Taxon
10013526 k__Animalia;p__Chordata;c__Actinopterygii;o__Perciformes;f__Serranidae;g__Caesioperca;s__Caesioperca rasor
10013530 k__Animalia;p__Chordata;c__Actinopterygii;o__Tetraodontiformes;f__Tetraodontidae;g__Contusus;s__Contusus brevicaudus
10013534 k__Animalia;p__Chordata;c__Actinopterygii;o__Perciformes;f__Cheilodactylidae;g__Cheilodactylus;s__Cheilodactylus variegatus
and then reran the evaluate-taxonomy
function.
Then I got the exact same result ...
Thanks to @SoilRotifer @Nicholas_Bokulich and others for any help you can offer in trying to understand why the evaluate-taxonomy
function isn't calculating the empty labels as I'd expect!