Hi @smayne11, thanks for your question.
I think you should give the ANML classifier a shot. I'd be curious to see how many of your sequence variants are classified (or unclassified), and among those with some taxonomic labels, what fraction are being assigned Family, Genus, or Species-level information.
If the classifier isn't working for you and you want to generate your primer-specific classifier starting from the broader BOLD sequence and taxonomy files now hosted by QIIME2 here:
- https://qiime2-data.s3-us-west-2.amazonaws.com/community-contributions-data-resources/2021.04-bold-resources/bold_anml_classifier.qza
- https://qiime2-data.s3-us-west-2.amazonaws.com/community-contributions-data-resources/2021.04-bold-resources/bold_anml_seqs.qza
- https://qiime2-data.s3-us-west-2.amazonaws.com/community-contributions-data-resources/2021.04-bold-resources/bold_anml_taxa.qza
- https://qiime2-data.s3-us-west-2.amazonaws.com/community-contributions-data-resources/2021.04-bold-resources/bold_rawSeqs.qza
- https://qiime2-data.s3-us-west-2.amazonaws.com/community-contributions-data-resources/2021.04-bold-resources/bold_rawTaxa.qza
Even if you lack the local computing power, you might dive into renting a machine through something like Google Cloud or AWS - that's the route I ended up taking for some of these compute-heavy tasks.
Good luck, and do please let me know how you make out with the ANML classifier.