Hello
I'm running qiime2-2021.8 (Conda install) on some ITS data and following along with this tutorial: tutorial.
I did a "grep" of the primer sequences (and reverse-complement) my raw sequences and can see that I have read-through. I then ran cutadapt to remove these and am a little confused by the results.
I imported the data and can see that the reads are 300bps as they should be (pre-trimming):
qiime tools import \
--type 'SampleData[PairedEndSequencesWithQuality]' \
--input-path raw_data \
--input-format CasavaOneEightSingleLanePerSampleDirFmt \
--output-path analysis/seqs/combined_seqs.qza
qiime demux summarize \
--i-data analysis/seqs/combined_seqs.qza \
--o-visualization analysis/visualisations/combined_seq.qzv
combined_seq.qzv (318.8 KB)
I then ran cutadapt.
#Forward primer: CTTGGTCATTTAGAGGAAGTAA
#Reverse complement of forward primer: TTACTTCCTCTAAATGACCAAG
#reverse primer: GCTGCGTTCTTCATCGATGC
#reverse complement of the reverse primer: GCATCGATGAAGAACGCAGC
qiime cutadapt trim-paired \
--i-demultiplexed-sequences analysis/seqs/combined_seqs.qza \
--p-adapter-f GCATCGATGAAGAACGCAGC \
--p-front-f CTTGGTCATTTAGAGGAAGTAA \
--p-adapter-r TTACTTCCTCTAAATGACCAAG \
--p-front-r GCTGCGTTCTTCATCGATGC \
--output-dir analysis/seqs_trimmed
qiime demux summarize \
--i-data analysis/seqs_trimmed/trimmed_sequences.qza \
--o-visualization analysis/visualisations/trimmed_sequences.qzv
trimmed_sequences.qzv (325.5 KB)
After running cutadapt above, I was surprised that more sequence in my forward reads had not been trimmed as shown in Demultiplexed sequence length summary (based on what I'd seen in my grep command).
I then ran cutadapt with the parameters on their own to see what the results would be and they seem to trim the data accordingly (again looking at the Demultiplexed sequence length summary):
qiime cutadapt trim-paired --i-demultiplexed-sequences analysis/seqs/combined_seqs.qza --p-adapter-f GCATCGATGAAGAACGCAGC --output-dir analysis/seqs_trimmed_GCATCGATGAAGAACGCAGC
qiime demux summarize --i-data analysis/seqs_trimmed_GCATCGATGAAGAACGCAGC/trimmed_sequences.qza --o-visualization analysis/visualisations/trimmed_sequences_GCATCGATGAAGAACGCAGC.qza
trimmed_sequences_GCATCGATGAAGAACGCAGC.qza.qzv (324.6 KB)
qiime cutadapt trim-paired --i-demultiplexed-sequences analysis/seqs/combined_seqs.qza --p-front-f CTTGGTCATTTAGAGGAAGTAA --output-dir analysis/seqs_trimmed_CTTGGTCATTTAGAGGAAGTAA
qiime demux summarize --i-data analysis/seqs_trimmed_CTTGGTCATTTAGAGGAAGTAA/trimmed_sequences.qza --o-visualization analysis/visualisations/trimmed_sequences_CTTGGTCATTTAGAGGAAGTAA.qza
trimmed_sequences_CTTGGTCATTTAGAGGAAGTAA.qza.qzv (324.2 KB)
However, when I run the two parameters at the same time, it seems not to trim the reverse complement ( Demultiplexed sequence length summary).
qiime cutadapt trim-paired --i-demultiplexed-sequences analysis/seqs/combined_seqs.qza --p-adapter-f GCATCGATGAAGAACGCAGC --p-front-f CTTGGTCATTTAGAGGAAGTAA --output-dir analysis/seqs_trimmed_both-f
qiime demux summarize --i-data analysis/seqs_trimmed_both-f/trimmed_sequences.qza --o-visualization analysis/visualisations/trimmed_sequences_both-f.qza
trimmed_sequences_both-f.qza.qzv (324.7 KB)
Am I missing something? I thought that it should trim both the forward primer and the reverse complement of the reverse primer?
Thanks
Gayle