Hi,
Could I please have some advice to improve these reusults? This is an example using a subset of my samples.
I am working on 150bp ZBJ COI paired sequences. My pipeline is as follows
qiime cutadapt trim-paired
--p-cores 5
--i-demultiplexed-sequences ZBJ_dataimported.qza
--p-front-f CTGTCTCTTATACACATCTCCGAGCCCACGAGAC
--p-front-r CTGTCTCTTATACACATCTGACGCTGCCGACGA
--p-no-discard-untrimmed
--o-trimmed-sequences cutadapters.qza
qiime cutadapt trim-paired
--p-cores 10
--i-demultiplexed-sequences cutadapters.qza
--p-front-f AGATATTGGAACWTTATATTTTATTTTTGG
--p-front-r WACTAATCAATTWCCAAATCCTCC
--p-match-read-wildcards
--p-match-adapter-wildcards
--p-discard-untrimmed
--o-trimmed-sequences miseq.cut.noadapters.qza
cut results miseq.cut.noadapters.qzv (323.4 KB)
qiime dada2 denoise-paired
--i-demultiplexed-seqs miseq.cut.noadapters.qza
--p-trunc-len-f 105
--p-trunc-len-r 125
--p-n-threads 20
--o-table ZBJ_featuretable-miseq.noadapters.qza
--o-representative-sequences ZBJ_rep_seqs-miseq.noadapters.qza
--o-denoising-stats ZBJ_stats-miseq.qza
ZBJ_stats-miseq.qzv (1.2 MB)
qiime feature-classifier classify-consensus-vsearch
--i-query ZBJ_rep_seqs-miseq.noadapters.qza
--i-reference-reads arthropoda-ref-seqs-derep.qza
--i-reference-taxonomy arthropoda-ref-tax-derep.qza
--p-perc-identity 0.97
--p-threads 5
--o-classification taxonomy.qza
--o-search-results results.qza
My vsearch results only moslty only goes to genus. The reference database was downloaded from bold and dereplicated.
I read on a forum that the mimimum overlap needs to be aleast 20bp although the default setting is 12bp. The overlap in my sequences is 105+125 = 225 - 211 (ZBJ amplicon length) = 14. Therefore I have also tried Deblur as an alternative approach.
qiime vsearch merge-pairs
--i-demultiplexed-seqs miseq.cut.qza
--p-threads 7
--o-merged-sequences miseq.cut.joined.qza
miseq.cut.joined.qzv (301.8 KB)
qiime deblur denoise-other
--i-demultiplexed-seqs miseq.cut.joined.qza
--i-reference-seqs arthropoda-ref-seqs-derep.qza
--p-trim-length 156
--p-sample-stats
--p-jobs-to-start 4
--o-representative-sequences miseq.joined.rep-seqs.qza
--o-table deblur.joined.table.qza
--o-stats deblur-joined.stats.qza
deblur.stats.qzv (229.7 KB)
I carried out vsearch again with the same parameters and recieved the same results.
taxonomy.dada2.tsv (22.4 KB)
Any advice would be greatly appreciated, thank you.