Hello,
I am processing 16S rRNA V3-V4 data using QIIME2 2024.10 (miniconda qiime2-amplicon-2024.10). For the classification step, I am using GreenGene2 v2024.9 (2024.09.backbone.v4.nb.qza) with the qiime feature-classifier classify-sklearn command. However, the resulting taxonomy table/plot shows that most of the data is classified as "Unassigned." In some samples, up to 83% of the data was unassigned. For this I am using ASVs after Deblur.
However, when I tested the same reads with 2024.09.backbone.full-length.nb.qza, the data was successfully classified.
This same pattern occurred with other datasets denoised with DADA2. Additionally, these two datasets are not related.
The command I used:
-
With V4 classifier:
qiime feature-classifier classify-sklearn
--i-classifier 2024.09.backbone.v4.nb.sklearn-1.4.2.qza
--i-reads rep-seqs-deblur.qza
--p-n-jobs 6
--o-classification taxonomy_gg24_v4.qza -
With full lenght classifier:
qiime feature-classifier classify-sklearn
--i-classifier 2024.09.backbone.full-length.nb.sklearn-1.4.2.qza
--i-reads rep-seqs-deblur.qza
--p-n-jobs 6
--o-classification taxonomy_gg24_full.qza
Can someone help me understand what might be causing this? Can I use the data classified with the full-length classifier?
print of taxonomy.qzv after V4 classifier
print of taxonomy.qzv after full lenght classifier

