When I ran the classifier, all my ASVs are unassigned. However, when I export 2022.10.backbone.targeted.fna.qza into a fasta file , index it as a Blast db and execute blast of the representative sequences vs the 2022.10.backbone.targeted.fna db, I get hits covering the entire query ASV. I also get classifications using the silva database trained for the same region. I looked at the taxonomy of the top blast hits and observed that they share a family or order level, however, I haven't checked all of them. Running the classification using the full length 16S classifier, produced results. is there a way to identify the driving taxa that lead to the classification?
Thanks, @Alexandra_Bastkowska! Sorry, for the delay, I thought I had clicked reply on this message a few days ago
If I read that right, those BLAST results look to start at the first bp give or take of the backbone sequences. The forward primer specified in the seed of this thread is I believe 341F. Is it possible the wrong primer was used in the extract-reads step?
sorry,I just noticed your reply.
The blast db was based on the output of the extracted region:
qiime feature-classifier extract-reads
--i-sequences 2022.10.backbone.full-length.fna.qza
--p-f-primer CCTAYGGGRBGCASCAG --p-r-primer GGACTACNNGGGTATCTAAT
--p-read-orientation both
--o-reads 2022.10.backbone.targeted.fna.qza
--p-n-jobs 1
qiime tools export --input-path 2022.10.backbone.targeted.fna.qza --output-path sequences_targeted
makeblastdb -in sequences_targeted/dna-sequences.fasta -dbtype nucl
blastn -query rep-seqs/dna-sequences.fasta -db sequences_targeted/dna-sequences.fasta -outfmt 6
I did not blast the rep seq against the full-length backbone, but just the subsequence (starting from 341f) that I also used as an input for training the classifier. This way I wanted to check if I match anything at all in the trimmed sequences. Hence, the input for the blast search as well for the classifier training is the same sequence fragment. Hope this clarifies my approach. I am happy to provide my representative sequence file, if that helps