After qiime2 denoise-single my samples, some are gone

1115 · July 30, 2018, 1:18pm

Hi, all!!

while I'm struggling to do beta diversity, I found that there is a problem on my table after denoise with DADA2. I double checked my manifest file and metadata file, I reassure that there are 9samples in total in both of them with exact information and file path. But after denoise-single, 3 of them are gone. Only 6 samples remain in table after dada2 denoise and 3 are gone. Below is my code for denoise.

qiime dada2 denoise-single --i-demultiplexed-seqs demux.qza --p-trim-left 0 --p-trunc-len 550 --o-representative-sequences rep-seqs-dada2.qza --o-table table-dada2.qza --o-denoising-stats stats-dada2.qza --verbose

Plus:

I also tried denoise with deblur, it shows only 2 samples, 7 samples are gone. I totally have no idea what is wrong

Below is deblur code I input.

qiime quality-filter q-score --i-demux demux.qza --o-filtered-sequences demux-filtered.qza --o-filter-stats demux-filter-stats.qza qiime deblur denoise-16S --i-demux.qza --p-trim-length 550 --o-representative-sequences rep-seqs-deblur.qza --o-table table-deblur.qza --p-sample-stats --o-stats deblur-stats.qza

Please HELP

thermokarst · July 30, 2018, 11:37pm

Can you please provide your sequence quality plot from demux summarize? This will help us understand the nature of the read length and quality, prior to quality control. Thanks!

1115 · July 31, 2018, 12:54pm

Hi, @thermokarst! Thank you for your reply.

Here is the demux summairze.

Thank you!

thermokarst · August 2, 2018, 1:29pm

Thanks for sharing, @1115!

I would try truncating at around 400 nt. Once done, run feature-table summarize on the resulting table and share here, we can take a look at the results together.

PS - these reads are super long, what sequencing platform are they from?

1115 · August 3, 2018, 1:14am

Thank you for reply, @thermokarst!!

Your reply helps me a lot! May I ask you one more question related to quality?

I checked the quality at first and I agonized between leaving reads with low quality but long and reads with high quality but short. As you said, if I cut reads at 400nt, the overall quality is improved but half of them are gone. Doesn't it have problem? If it's up to the case, how can I determine it??

Thank you!

P.S the platform noticed on metadata is LS454. How long are mostly reads ?

thermokarst · August 6, 2018, 10:45pm

You could try experimenting with several sets of DADA2 parameters and then compare some downstream results computed with those different outputs, such as diversity and taxonomic classification to see how these parameters are impacting your particular study.

1115 · August 7, 2018, 2:47am

Appreciate with your reply, @thermokarst!

Thank you again, your wonderful reply!! I have one last question of denoising with dada2.

I tried dada2 denoise-single procedure and the results have all samples but most of them have 0 sequences per samples. Is it because standard quality I set for trimming is too high??

Thank you

Mehrbod_Estaki · August 7, 2018, 3:09am

Hi @1115,

Are you using dada2 denoise-single? You have 454 data so you want to be using the dada2 denoise-pyro instead since that is designed with 454 reads in mind.

system · September 7, 2018, 9:17am

This topic was automatically closed 31 days after the last reply. New replies are no longer allowed.