Hi,
hoping someone can help me work out what's happening. New to this and completely stumped. I ran the RESCRIPt process to get a V1-V3 classifier (SILVA 138.1) and noticed that I lost everything except bacteria. I am looking at cattle samples and they most definitely have Archaea in their gut.
I reran the RESCRIPt workflow and checked the .qzv as I went along to check where I lost the taxa.
After the 'qiime rescript dereplicate' step, I can see Archaea in the qzv. Once I run the 'feature-classifier fit-classifier-naive-bayes' and 'qiime feature-classifier classify-sklearn' steps the next qzv has nothing but bacteria.
Not exactly sure what I'm doing wrong - it may be as simple as a missed flag or a misunderstanding of the workflow. Hoping someone can help. Don't know how to debug things other than checking the qzv.
Below are the script steps and excerpts of what is seen in the qzv
Dereplicate steps:
qiime rescript dereplicate
--i-sequences silva-138.1-ssu-nr99-seqs-filt.qza
--i-taxa silva-138.1-ssu-nr99-tax.qza
--p-mode 'uniq'
--p-threads 12
--o-dereplicated-sequences silva-138.1-ssu-nr99-seqs-derep-uniq.qza
--o-dereplicated-taxa silva-138.1-ssu-nr99-tax-derep-uniq.qza
--verbose
qiime metadata tabulate
--m-input-file silva-138.1-ssu-nr99-tax-derep-uniq.qza
--o-visualization silva-138.1-ssu-nr99-tax-derep-uniq.qzv
Archaea present at the end of this step
Then I run the next two steps and lose all Archaea in the classifier before I even make the amplicon region-specific classifier.
qiime feature-classifier fit-classifier-naive-bayes
--i-reference-reads silva-138.1-ssu-nr99-seqs-derep-uniq.qza
--i-reference-taxonomy silva-138.1-ssu-nr99-tax-derep-uniq.qza
--o-classifier silva-138.1-ssu-nr99-classifier-full.qza
--verbose
qiime feature-classifier classify-sklearn
--i-classifier silva-138.1-ssu-nr99-classifier-full.qza
--i-reads 04dada2_output/rep_seqs.qza
--o-classification testtaxonomy.qza
--verbose
qiime metadata tabulate
--m-input-file testtaxonomy.qza
--o-visualization testtaxonomy.qzv
I've tried to work it out myself but I don't know enough about how the data is being processed to identify exactly what is going wrong (or what I did wrong).
Thanks for your help.
Rachele
Not sure what other info is needed - below is my system and qiime info:
Running on Windows 11
Processor Intel(R) Core(TM) i7-10875H CPU @ 2.30GHz 2.30 GHz
Installed RAM 32.0 GB (31.8 GB usable)
WSL2
Distributor ID: Ubuntu
Description: Ubuntu 20.04.6 LTS
Conda version: conda 23.7.2
(qiime2-2023.5):~$ qiime info
System versions
Python version: 3.8.16
QIIME 2 release: 2023.5
QIIME 2 version: 2023.5.1
q2cli version: 2023.5.1
Installed plugins
alignment: 2023.5.0
composition: 2023.5.0
cutadapt: 2023.5.1
dada2: 2023.5.0
deblur: 2023.5.0
demux: 2023.5.0
diversity: 2023.5.1
diversity-lib: 2023.5.0
emperor: 2023.5.0
feature-classifier: 2023.5.0
feature-table: 2023.5.0
fragment-insertion: 2023.5.0
gneiss: 2023.5.0
longitudinal: 2023.5.0
metadata: 2023.5.0
phylogeny: 2023.5.0
quality-control: 2023.5.0
quality-filter: 2023.5.0
sample-classifier: 2023.5.0
taxa: 2023.5.0
types: 2023.5.0
vsearch: 2023.5.0