Bad quality - plot quality

I everyone!

I imported my all data and I reached the quality scores plot. Here, it seems to me that my sequences are terrible... However, I am not sure about how problematic this can be...

  1. What can cause this poor quality?
  2. Can anyone explain to me what should I do in this situation?
  3. Can this data still be used for analysis? I thought about joining read and reverse and try to denoising after that...

Thanks a lot!!
Jara

1 Like

Hello Jara,

Welcome to the forums! :qiime2:

It's always a shame when a run does not sequence well :crying_cat_face:

Overloading the Illumina flow cell. Underloading the flow cell. Poor amplification during PCR, and everything that can cause that.

Hopefully it's a sequencing issue and not a library amplification issue. I would ask if the sequencing core would be willing to resequence these samples, hopefully for free or maybe a discounted price. :money_with_wings:

Sure! I'm worried about the quality drop at base 20 in the forward read, but it's always worth a try...

Thanks a lot for the answers :person_gesturing_ok::+1:

2 Likes

In addition to the possible reasons already mentioned, the beginning of the forward reads makes me think that the sequencing library might not have had enough nucleotide diversity in some positions. When all bases are the same in a given sequencing cycle, the instrument software has trouble confidently calling bases (more info on this topic here). These could be primers at the start of your amplicon or conserved regions of the target gene. To avoid this, we usually add at least 5% PhiX or some library with a random distribution of nucleotides (a genomic library of a bacterial isolate, for example) in our MiSeq runs of metabarcode sequencing. If you arrange resequencing of these samples, it might be a good idea to check whether the nucleotide diversity issue is properly addressed in the sequencing setup.

2 Likes

Thanks for your help and the explanation! I imagined something like that...
Unfortunately, we will not be able to redo the analyses.
I tried using FLASH and Prinseq to do some cutting and joining of the forward and reverse , but I'm getting a very low value of input numeric (dada_stat). I will try not joined the forward and reverse and do a 'no aggressive' denoising.

1 Like

This topic was automatically closed 31 days after the last reply. New replies are no longer allowed.