Time for DADA2 process

Vicky_Rodriguez · May 12, 2023, 12:10am

Hello, I´m Vicky again!

A few days ago I received my .fastq.gz files from the V3-V4 region, the sequencing was done in a NextSeq. The demultiplexing step took about 2 hrs, and we decided to cut at 240 bp to do DADA2, it has been running for more than 48 hrs and does not finish yet.

I have 56 samples and the input file of DADA2 has 11.1G, do you think that by the gigabytes, quality and number of samples it takes much longer? I’m working on a cluster or server, which has more memory than a local computer.

timanix · May 12, 2023, 1:36pm

Hello!
I am not surprised at all that dataset 11 Gb in size is taking more than 2 days to be complete - this size means that there are a lot of sequences to process.
What surprise me is that you have only 56 samples - usually my samples from NextSeq and in the same number are much lighter (1-2 gb or less).

Best,

Vicky_Rodriguez · May 13, 2023, 11:25pm

Are you think there’s a mistake in the FASTQ documents that loaded in my BaseSpace?

Thank’s !

timanix · May 14, 2023, 5:03am

No, I just surprised by the size of the library, but it is not necessarily an error.