Hello!
I am looking for a naive-bayes classifier based on Silva reference database for V3-V4 region, 341F, 805R, generated with scikit-learn version 0.21.2.
One potential solution is to install the QIIME 2 version that was used to generate that classifier — looks like 2019.4 uses scikit-learn=0.20.2. You can install separate environments using conda, so that you can switch to QIIME 2-2019.4 for taxonomy classification, then switch back to the latest release for downstream analyses.
it sounds like this is a new problem — memory errors with classify-sklearn instead of with fit-classifier-naive-bayes — correct?
if so, check out the forum archive (click on the symbol in the top-right corner of the forum website to enter search terms). Search for MemoryError --p-reads-per-batch to find many posts with troubleshooting advice. There are some ways to mitigate memory constraints with classify-sklearn, which may help you get this command running now.
You should do that too! Because then you (and others) will have the capacity to run higher-memory jobs in the future.
I have a Mac 8 GB 1867 MHz DDR3, 2.7 GHz Intel Core i5, with two processors and it took me 5 h to run.
I hope this helps others.
Thank you very much to you Nicholas and to my friend Checo for your recommendations.