Hello, I downloaded the database for CO1 from MIDORI 2 (Download).
I unzipped the files, imported into QIIME2-2023.2. I run below command in my mac, and it did not work.
/miniconda3/envs/qiime2-2023.2/lib/python3.8/site-packages/q2_feature_classifier/classifier.py:102: UserWarning: The TaxonomicClassifier artifact that results from this method was trained using scikit-learn version 0.24.1. It cannot be used with other versions of scikit-learn. (While the classifier may complete successfully, the results will be unreliable.)
I also run the same command in a server and got below error message: Plugin error from feature-classifier:
Unable to allocate 120. GiB for an array with shape (20000, 807796) and data type int64
****Debug info has been saved to /tmp/qiime2-q2cli-err-djk90mzh.log
I was wondering what the warning message means and how I can train the classifier properly.
Hi again @eDNA,
Another moderator chimed in with some helpful advice!
Have you dereplicated your reference database? Here is the rescript tutorial that may be helpful with that.
You could also skip the classifier and run feature-classifier classify-consensus-blast or feature-classifier classify-consensus-vsearch. For this you may want to consider setting either --p-maxaccepts 32 --p-maxrejects 128 or --p-maxaccepts 64 --p-maxrejects 256 as opposed to performing an exhaustive search.