In case anyone is interested I have a pretrained classifier on the V3-V4 region using the following primers:
341F: CCTACGGGNGGCWGCAG
805R: GACTACHVGGGTATCTAATCC
The primer set is taken from Illumina's 16S library preparation protocol.
It was trained using the Greengenes 99_otu_taxonomy.txt using naive bayes method on qiime2017.12, its about 44MB.
If this is something you'd like to use let me know!
-Bod
I'm interested in the V3-V4 classifier you used. I have exactly the same primers pair in my data. I would also like to take a look on the commands you used, so I can try to reproduce it in my computer from the gg-13-8 full-length sequences database.
Here is a dropbox link to the classifier to download, let me know if you run into any problems. You can follow the exact commands within the provenance in the file to see what was exactly done.
Hope this helps!
Hi @sakther1, Sorry for the slow reply. We're discussing how we can provide this and hope to have an update shortly. Note though that the full-length classifier should work almost as well, so that's a totally fine one to use too.
Hi @sakther1, My recommendation right now is to use the full-length classifier that is linked from https://resources.qiime2.org. There is not a very big performance difference between the region-specific classifiers and the full-length classifiers. An alternative though, if you do want to use a region-specific classifier, would be to train one using q2-feature-classifier. The files that @Mehrbod_Estaki used for training this one are available for download here.