Hi @jjmmii,
No. The distinction is that the first row (ending in __;__
) cannot be confidently classified beyond family level (probably because a close match does not exist in the reference database). So sequences receiving that classification can be any taxon in f__Geodermatophilaceae
. The second row (ending in g__;s__
) DOES have a close match in the reference database and hence is confidently classified at species level — unfortunately, that close match does not have genus or species-level annotations. This does not in any way imply that these two different taxonomic affiliations are related beyond the family level, so it would probably be inappropriate (or at least presumptuous) to collapse these at species level.
I would recommend considering these to be unique taxa unless if you have other evidence that they should be collapsed. (e.g., reclassify these features with another classifier like classify-consensus-blast
and/or with SILVA database to see if these give a better idea of what these features may represent — with the caveat that a more satisfying answer may not necessarily be the "correct" answer)
An easier way to do this for your use case (counting the number of unique taxa) would probably be to filter on taxonomy in QIIME2. Passing something like ";__" to the exclude
parameter might accomplish what you are describing, though I have not tested this so cannot be sure.
Good luck! I hope that provides some more clarity.