I'm glad to use brand-new database Greengenes2!
I have a few questions while I'm using it.
I failed to execute the greengenes2 plugin, although I’ve done 'pip install q2-greengenes2' (error: QIIME 2 has no plugin/command named 'greengenes2'). I suspect it might be a version update problem with Ubuntu (my version is 20.04.6 LTS).
Therefore, instead of using greengenes2 plugin, I downloaded '2022.10.backbone.full-length.fna.qza' and '2022.10.backbone.tax.qza' from the Index of greengenes_release/2022.10.
Is downloading the two files ('2022.10.backbone.full-length.fna.qza' and '2022.10.backbone.tax.qza') and proceeding with the rest of the process the same as creating the two files through 'If there is data other than V4'?
When I download two files from online ('2022.10.backbone.full-length.fna.qza' and '2022.10.backbone.tax.qza' ), are there md5Sum values available?
For performing 'feature-classifier' using naïve-bayes, it only took 1-2 hours. Can it be this fast? (Of course, the speed depends on the CPU and GPU I use.)
I'm training a classifier on V3-V4 regions using Greengenes2 using the following commands below.
I don't recall off hand what to expect for the performance of application of feature-classifier. Is that performance okay for your use?
These seem reasonable. For reference, the commands used to construct the release classifiers can be found here.
There are a lot of public data readily accessible in Qiita, and which can be obtained via the command line using redbiom. A tutorial on that can be found here.
When I did 'pip install q2-greengenes2', there were a bunch of 'Requirement already satisfied... sentences, looked like the installation was successful. Also I installed Qiime2 version 2023.9.