Thank you for explaining the changes between both versions of RESCRIPt. It's clear now.
You are right, I was not very clear in my comments about the effects of the options for --p-read-orientation
with classify-sklearn
. What I meant to say is that using Ā“--p-read-orientation reverse-complementĀ“ with q2.2024.5, did not improve the results (taxonmy is incorrect with all optoins). Of course, it impacted the q2.2024.2
taxonomy (with reverse-complement, wrong; auto and same,OK).
This sample is a mock community of 8 strains, that was mixed with a soil sample (that was for a paticular test; but with q2.2024.2 barplots qzv you will see the 8 main bacteria).
I am sending you all the files that you asked for along with the code I used.
q2.2024.2 qzv(s):
tabulate-seq_same-2024.2.qzv (346.0 KB)
tabulate-seq_auto-2024.2.qzv (346.5 KB)
tabulate-seq_reverse-complement-2024.2.qzv (341.4 KB)
taxa_barplot_same-2024.2.qzv (401.8 KB)
taxa_barplot_auto-2024.2.qzv (401.8 KB)
taxa_barplot_reverse-complement-2024.2.qzv (388.5 KB)
taxonomy_same-2024.2.qzv (1.3 MB)
taxonomy_auto-2024.2.qzv (1.3 MB)
taxonomy_reverse-complement-2024.2.qzv (1.3 MB)
q2.2024.5 qzv(s):
tabulate-seq_same-2024.5.qzv (348.1 KB)
tabulate-seq_auto-2024.5.qzv (344.0 KB)
tabulate-seq_reverse-complement-2024.5.qzv (344.1 KB)
taxa_barplot_same-2024.5.qzv (402.8 KB)
taxa_barplot_auto-2024.5.qzv (392.1 KB)
taxa_barplot_reverse-complement-2024.5.qzv (392.1 KB)
taxonomy_same-2024.5.qzv (1.3 MB)
taxonomy_auto-2024.5.qzv (1.3 MB)
taxonomy_reverse-complement-2024.5.qzv (1.3 MB)
Many thanks in advance for checking this!
############# q2.2024.2 #############
cd ~/qiime2/analysis/Starling2021/debug_classifiers/q2.2024.2
conda activate qiime2-amplicon-2024.2
Import data
qiime tools import
--type 'SampleData[PairedEndSequencesWithQuality]'
--input-path ./manifest_debug.tsv
--output-path demux-paired-end.qza
--input-format PairedEndFastqManifestPhred33V2
qiime demux summarize
--i-data demux-paired-end.qza
--o-visualization demux-paired-end.qzv
Cutadapt
qiime cutadapt trim-paired
--i-demultiplexed-sequences demux-paired-end.qza
--p-cores 30
--p-front-f ACGCGHNRAACCTTACC
--p-adapter-f TTGYACWCACYGCCCGT
--p-front-r ACGGGCRGTGWGTRCAA
--p-adapter-r GGTAAGGTTYNDCGCGT
--p-error-rate 0.1
--p-indels
--p-overlap 3
--p-match-read-wildcards True
--p-match-adapter-wildcards True
--o-trimmed-sequences trimmed.qza
--p-times 1
--p-discard-untrimmed
--verbose > cutadapt_log.txt
qiime demux summarize
--i-data trimmed.qza
--o-visualization trimmed.qzv
DADA2
qiime dada2 denoise-paired
--i-demultiplexed-seqs trimmed.qza
--p-trim-left-f 0
--p-trim-left-r 0
--p-trunc-len-f 205
--p-trunc-len-r 215
--output-dir dada2
--p-n-threads 30
--p-pooling-method pseudo
--verbose
qiime tools export --input-path dada2/denoising_stats.qza --output-path dada2/
results DADA2
qiime metadata tabulate
--m-input-file dada2/denoising_stats.qza
--o-visualization dada2/stats-dada2.qzv
ASVs stats:
qiime feature-table summarize
--i-table dada2/table.qza
--o-visualization dada2/table.qzv
TAXONOMY
mkdir taxo
check three --p-read-orientation options:
qiime feature-classifier classify-sklearn
--i-classifier ~/qiime2/analysis/databases/16S/qiime_2024.2/silva-138-99-nb-classifier.qza
--i-reads dada2/representative_sequences.qza
--o-classification taxo/taxonomy_auto.qza
--p-n-jobs 15
--p-read-orientation auto
--verbose
qiime feature-classifier classify-sklearn
--i-classifier ~/qiime2/analysis/databases/16S/qiime_2024.2/silva-138-99-nb-classifier.qza
--i-reads dada2/representative_sequences.qza
--o-classification taxo/taxonomy_same.qza
--p-n-jobs 15
--p-read-orientation same
--verbose
qiime feature-classifier classify-sklearn
--i-classifier ~/qiime2/analysis/databases/16S/qiime_2024.2/silva-138-99-nb-classifier.qza
--i-reads dada2/representative_sequences.qza
--o-classification taxo/taxonomy_reverse-complement.qza
--p-n-jobs 15
--p-read-orientation reverse-complement
--verbose
barplots qzv(s):
qiime taxa barplot --i-table dada2/table.qza
--i-taxonomy taxo/taxonomy_auto.qza
--m-metadata-file metadata_debug.tsv
--o-visualization taxo/taxa_barplot_auto.qzv
qiime taxa barplot --i-table dada2/table.qza
--i-taxonomy taxo/taxonomy_same.qza
--m-metadata-file metadata_debug.tsv
--o-visualization taxo/taxa_barplot_same.qzv
qiime taxa barplot --i-table dada2/table.qza
--i-taxonomy taxo/taxonomy_reverse-complement.qza
--m-metadata-file metadata_debug.tsv
--o-visualization taxo/taxa_barplot_reverse-complement.qzv
metadata tabulate taxo
qiime metadata tabulate
--m-input-file taxo/taxonomy_auto.qza
--o-visualization taxo/taxonomy_auto.qzv
qiime metadata tabulate
--m-input-file taxo/taxonomy_same.qza
--o-visualization taxo/taxonomy_same.qzv
qiime metadata tabulate
--m-input-file taxo/taxonomy_reverse-complement.qza
--o-visualization taxo/taxonomy_reverse-complement.qzv
rep seqs qzv(s):
qiime feature-table tabulate-seqs
--i-data dada2/representative_sequences.qza
--i-taxonomy taxo/taxonomy_auto.qza
--o-visualization taxo/tabulate-seq_auto.qzv
qiime feature-table tabulate-seqs
--i-data dada2/representative_sequences.qza
--i-taxonomy taxo/taxonomy_same.qza
--o-visualization taxo/tabulate-seq_same.qzv
qiime feature-table tabulate-seqs
--i-data dada2/representative_sequences.qza
--i-taxonomy taxo/taxonomy_reverse-complement.qza
--o-visualization taxo/tabulate-seq_reverse-complement.qzv
############# q2.2024.5 #############
cd ~/qiime2/analysis/Starling2021/debug_classifiers/q2.2024.5
conda activate qiime2-amplicon-2024.5
Import data
qiime tools import
--type 'SampleData[PairedEndSequencesWithQuality]'
--input-path ./manifest_debug.tsv
--output-path demux-paired-end.qza
--input-format PairedEndFastqManifestPhred33V2
qiime demux summarize
--i-data demux-paired-end.qza
--o-visualization demux-paired-end-2024.5.qzv
Cutadapt
qiime cutadapt trim-paired
--i-demultiplexed-sequences demux-paired-end.qza
--p-cores 30
--p-front-f ACGCGHNRAACCTTACC
--p-adapter-f TTGYACWCACYGCCCGT
--p-front-r ACGGGCRGTGWGTRCAA
--p-adapter-r GGTAAGGTTYNDCGCGT
--p-error-rate 0.1
--p-indels
--p-overlap 3
--p-match-read-wildcards True
--p-match-adapter-wildcards True
--o-trimmed-sequences trimmed.qza
--p-times 1
--p-discard-untrimmed
--verbose > cutadapt_log.txt
qiime demux summarize
--i-data trimmed.qza
--o-visualization trimmed-2024.5.qzv
DADA2
qiime dada2 denoise-paired
--i-demultiplexed-seqs trimmed.qza
--p-trim-left-f 0
--p-trim-left-r 0
--p-trunc-len-f 205
--p-trunc-len-r 215
--output-dir dada2
--p-n-threads 30
--p-pooling-method pseudo
--verbose
qiime tools export --input-path dada2/denoising_stats.qza --output-path dada2/
results DADA2
qiime metadata tabulate
--m-input-file dada2/denoising_stats.qza
--o-visualization dada2/stats-dada2-2024.5.qzv
ASVs stats:
qiime feature-table summarize
--i-table dada2/table.qza
--o-visualization dada2/table-2024.5.qzv
TAXONOMY
mkdir taxo
check three --p-read-orientation options:
qiime feature-classifier classify-sklearn
--i-classifier ~/qiime2/analysis/databases/16S/qiime_2024.5/silva-138-99-nb-classifier.qza
--i-reads dada2/representative_sequences.qza
--o-classification taxo/taxonomy_auto.qza
--p-n-jobs 15
--p-read-orientation auto
--verbose
qiime feature-classifier classify-sklearn
--i-classifier ~/qiime2/analysis/databases/16S/qiime_2024.5/silva-138-99-nb-classifier.qza
--i-reads dada2/representative_sequences.qza
--o-classification taxo/taxonomy_same.qza
--p-n-jobs 15
--p-read-orientation same
--verbose
qiime feature-classifier classify-sklearn
--i-classifier ~/qiime2/analysis/databases/16S/qiime_2024.5/silva-138-99-nb-classifier.qza
--i-reads dada2/representative_sequences.qza
--o-classification taxo/taxonomy_reverse-complement.qza
--p-n-jobs 15
--p-read-orientation reverse-complement
--verbose
barplots qzv(s):
qiime taxa barplot --i-table dada2/table.qza
--i-taxonomy taxo/taxonomy_auto.qza
--m-metadata-file metadata_debug.tsv
--o-visualization taxo/taxa_barplot_auto-2024.5.qzv
qiime taxa barplot --i-table dada2/table.qza
--i-taxonomy taxo/taxonomy_same.qza
--m-metadata-file metadata_debug.tsv
--o-visualization taxo/taxa_barplot_same-2024.5.qzv
qiime taxa barplot --i-table dada2/table.qza
--i-taxonomy taxo/taxonomy_reverse-complement.qza
--m-metadata-file metadata_debug.tsv
--o-visualization taxo/taxa_barplot_reverse-complement-2024.5.qzv
metadata tabulate taxo
qiime metadata tabulate
--m-input-file taxo/taxonomy_auto.qza
--o-visualization taxo/taxonomy_auto-2024.5.qzv
qiime metadata tabulate
--m-input-file taxo/taxonomy_same.qza
--o-visualization taxo/taxonomy_same-2024.5.qzv
qiime metadata tabulate
--m-input-file taxo/taxonomy_reverse-complement.qza
--o-visualization taxo/taxonomy_reverse-complement-2024.5.qzv
rep seqs qzv(s):
qiime feature-table tabulate-seqs
--i-data dada2/representative_sequences.qza
--i-taxonomy taxo/taxonomy_auto.qza
--o-visualization taxo/tabulate-seq_auto-2024.5.qzv
qiime feature-table tabulate-seqs
--i-data dada2/representative_sequences.qza
--i-taxonomy taxo/taxonomy_same.qza
--o-visualization taxo/tabulate-seq_same-2024.5.qzv
qiime feature-table tabulate-seqs
--i-data dada2/representative_sequences.qza
--i-taxonomy taxo/taxonomy_reverse-complement.qza
--o-visualization taxo/tabulate-seq_reverse-complement-2024.5.qzv