greengenes database

correct, use one of the pre-trained classifiers

might I note: training your own classifier could be highly beneficial. I have used the HOMD database in the past for saliva bacterial classification with q2-feature-classifier; it is an oral microbiome-specific database and using it together with q2-feature-classifier will increase likelihood of species-level classification. Worth comparing vs. greengenes or silva.

If you're up for a challenge (that will increase species classification accuracy even more) you could also try training a classifier with oral microbiome-specific taxonomic weights, as described in this tutorial:

You could follow that tutorial with either the HOMD or greengenes or SILVA sequences. See the "more exotic weights" section and swap out this line:

redbiom search metadata "cheese where cheese_type=='stilton'" > sample_ids

for this:

redbiom search metadata "where host_taxid==9606 and sample_type in ('Oral', 'oral', 'Mouth', 'mouth', 'Saliva', 'saliva')" > sample_ids
2 Likes