Hi,
I downloaded the sequence file of Greengenes2 (2024.09.backbone.full-length.nb.qza, 2024.09.backbone.tax.qza) and used the following command to create the classifier in the V34 area. I would like to know if it is necessary to cluster the Greengenes2 sequences with a 99% similarity and then create a classifier?
The command is as follows:
qiime feature-classifier extract-reads \
--i-sequences 2024.09.backbone.full-length.fna.qza \
--p-f-primer GTGCCAGCMGCCGCGGTAA \
--p-r-primer GGACTACHVGGGTWTCTAAT \
--p-min-length 400 \
--p-max-length 500 \
--o-reads v34.ref-seqs.qza \
--p-n-jobs 8
qiime feature-classifier fit-classifier-naive-bayes \
--i-reference-reads v34.ref-seqs.qza \
--i-reference-taxonomy 2024.09.backbone.tax.qza \
--o-classifier 2024.09.v34-classifier.qza
Thanks!