I'm trying to assign taxonomy to my sequencing data for the ITS region using RESCRIPt. When I manually use ncbiBLAST to assign my ASVs, I often pick up a genus that does not appear when I assign taxonomy using RESCRIPt using the NCBI or UNITE databases. Is there a way to search for a specific taxonomic assignment within a database?
The problem with BLAST is that it will show all hits that are similar to your query. Often, the closest hits will not agree on the same specific taxa (e.g. hit different species. Often if the hits are equivalent, the results are arbitrarily sorted. So, the top items in the list are not necessarily the best hits. The classifiers will collapse the taxonomy to the lowest common ancestor if your query hits multiple reference sequences equally well. Especially, if the sequence is identical across different taxa.
Thank you for this response; it helps to understand how the function works. However, I'm unsure if the genus is even within the databases. It doesn't appear at all within my samples when they're highly likely to contain that genus. Is there a way to look at the taxa the database contains?
Then you can search for your genera of interest within the taxonomy. That should let you know if they are contained within that database for classification.
Ah, thank you! This makes sense, I don't know why I didn't think about searching the visual file.. I'm looking for Ascosphaera, It doesn't appear in either of the databases I've made.
I've been using my F/R primer sequences to set my classifiers. I wonder if I'm filtering out this genus by doing so?
Well, the taxonomy file should be based on the full reference database. When making an amplicon-region specific classifier, you basically use the full database taxonomy file along with the subsetted extracted amplicon sequences. So, if you are unable to find the taxon in question it might not be present in the reference database... or be under a different taxon label.
For example.. a quick google search suggests that "Ascosphaera" used to be called "Pericystis". Does it exist under that or another label?
Ohh okay, I initially checked my classifier file and that is where I can't find the genus. I checked the original database and I can see 13 entries of the Ascosphaera genus. I must be cutting out too much when I'm making my classifier. I'll look into other ways to trim the database!