Dear all,
i know this is not the first post about using qiime2 with DADA2 to analyze sequences from different facilities and primers. However, after trying numerous possibilities and options (all found in this forum) I decided to write this post, as I don't know any other option anymore.
The situation is:
I have 2 sets of sequences (both 2x300bp; V3-V4) from activated sludege samples from the same waste water treatment plant. Sequencing was perfromed on MiSeq with the primer pair 341f and 802r for the first batch and 341f and 806r for the second batch.
After reading all posts about how to deal with problems like this I trimmed my reads with cutadapt:
qiime cutadapt trim-paired \
--i-demultiplexed-sequences /cluster/scratch/niederro/WEnzel/Uster_1/Uster_1.qza \
--p-cores 4 \
--p-front-f CCTACGGGNGGCWGCAG \
--p-front-r TACNVGGGTATCTAATCC \
--p-error-rate 0 \
--o-trimmed-sequences /cluster/scratch/niederro/WEnzel/Filtered_QZas/Uster_1_filt.qza
qiime cutadapt trim-paired \
--i-demultiplexed-sequences /cluster/scratch/niederro/WEnzel/Uster_2/Uster_2.qza \
--p-cores 4 \
--p-front-f CCTACGGGNGGCWGCAG \
--p-front-r GACTACHVGGGTATCTAATCC \
--p-error-rate 0 \
--o-trimmed-sequences /cluster/scratch/niederro/WEnzel/Filtered_QZas/Uster_2_filt.qza
For the 341/802 batch (#1) it worked fine and it removed ll primers. In the 2nd batch, however, no primers were attached anymore. Which left me with sequences from the 1st batch with ~285bp and sequences from the 2nd batch with 301 bp.
Uster_1_filt.qzv (298.9 KB) Uster_2_filt.qzv (296.6 KB)
I tried multiple different ways to trim them in the same way to uncover ASVs that are present in both batches. However, in most of the cases it worked for one batch but not the other and merging of the DADA2 outputs never led to a single common ASV.
Just one example:
qiime dada2 denoise-paired \
--i-demultiplexed-seqs /cluster/scratch/niederro/WEnzel/Uster_1/Uster_1_filt.qza \
--p-max-ee-f 2 \
--p-max-ee-r 3 \
--p-trunc-len-f 270 \
--p-trunc-len-r 205 \
--p-n-threads 4 \
--o-table /cluster/scratch/niederro/WEnzel/Uster_1/DADA2/Dada2_Uster_1.qza \
--o-representative-sequences /cluster/scratch/niederro/WEnzel/Uster_1/DADA2/DADA2_Uster1_rep-seqs.qza \
--o-denoising-stats /cluster/scratch/niederro/WEnzel/Uster_1/DADA2/Dada2_denoising-stats_Uster1.qza
qiime dada2 denoise-paired \
--i-demultiplexed-seqs /cluster/scratch/niederro/WEnzel/Uster_2/Uster_2.qza \
--p-trim-left-f 15 \
--p-trim-left-r 15 \
--p-max-ee-f 2 \
--p-max-ee-r 3 \
--p-trunc-len-f 285 \
--p-trunc-len-r 220 \
--p-n-threads 4 \
--o-table /cluster/scratch/niederro/WEnzel/Consensus_final_choices/Final_Final/Dada2_Uster_2.qza \
--o-representative-sequences /cluster/scratch/niederro/WEnzel/Consensus_final_choices/Final_Final/DADA2_Uster2_rep-seqs.qza \
--o-denoising-stats /cluster/scratch/niederro/WEnzel/Consensus_final_choices/Final_Final/Dada2_denoising-stats_Uster2.qza
Next I tried to take the raw sequences with following DADA2 parameters:
qiime dada2 denoise-paired \
--i-demultiplexed-seqs /cluster/scratch/niederro/WEnzel/Uster_1/Uster_1.qza \
--p-trim-left-f 17 \
--p-trim-left-r 18 \
--p-max-ee-f 2 \
--p-max-ee-r 3 \
--p-trunc-len-f 289 \
--p-trunc-len-r 220 \
--p-n-threads 4 \
--o-table /cluster/scratch/niederro/WEnzel/Uster_raw/Rawer/Dada2_Uster_1_new.qza \
--o-representative-sequences /cluster/scratch/niederro/WEnzel/Uster_raw/Rawer/DADA2_Uster1_rep-seqs_new.qza \
--o-denoising-stats /cluster/scratch/niederro/WEnzel/Uster_raw/Rawer/Dada2_denoising-stats_Uster1.new.qza
qiime dada2 denoise-paired \
--i-demultiplexed-seqs /cluster/scratch/niederro/WEnzel/Uster_2/Uster_2.qza \
--p-trim-left-f 17 \
--p-trim-left-r 18 \
--p-max-ee-f 2 \
--p-max-ee-r 3 \
--p-trunc-len-f 289 \
--p-trunc-len-r 220 \
--p-n-threads 4 \
--o-table /cluster/scratch/niederro/WEnzel/Uster_raw/Rawer/Dada2_Uster_2_raw_new.qza \
--o-representative-sequences /cluster/scratch/niederro/WEnzel/Uster_raw/Rawer/DADA2_Uster2_rep-seqs_new.qza \
--o-denoising-stats /cluster/scratch/niederro/WEnzel/Uster_raw/Rawer/Dada2_denoising-stats_Uster2_new.qza
Here I treid to make sure that (i) the primers get removed from batch 1 and (ii) sequences from batch 2 start at the same position as batch 1. Again, I was not able to find a single common ASV.
Dada2_Uster_1_new.qzv (1.1 MB) Dada2_Uster_2_new.qzv (597.3 KB)
Do you know what could be problem here or even better what could be the solution for this problem?
Thank you very much for any advice.
Robert