Dada2 denoise-paired running takes long time

Hello everyone,

I use the following command to run DADA2 in the sever . But it has been running more than 24 hours (75 samples and 2.9G) and don’t finish now. I don’t know how it is going. Hope to your reply, thanks vey much.

Firstly, I activated qiime2 remotely from my laptop and submitted the job with the command of "sbatch denoised-dada2-trimmed.sh".
The script of denoised-dada2-trimmed.sh is as follows.
#!/bin/bash
#SBATCH -J denoise-dada2-trimmed
#SBATCH -o denoise-dada2-trimmed.log
#SBATCH -e denoise-dada2-trimmed.err
#SBATCH --partition=high
#SBATCH --mem=60000M
#SBATCH -c 24
#SBATCH -t 60:00:00

qiime dada2 denoise-paired
--i-demultiplexed-seqs trimmed-seqs.qza
--p-trunc-len-f 240
--p-trunc-len-r 185
--p-n-threads 10
--p-n-reads-learn 10000
--o-table table-dada2.qza
--o-representative-sequences rep-seqs-dada2.qza
--o-denoising-stats denoising-stats-data2.qza
--verbose

After 1 day, I opened my laptop and found the job was running. The .log file is as follows.
R version 3.5.1 (2018-07-02)
DADA2: 1.10.0 / Rcpp: 1.0.2 / RcppParallel: 4.4.4

  1. Filtering ...........................................................................
  2. Learning Error Rates
    3268080 total bases in 13617 reads from 3 samples will be used for learning the error rates.
    2519145 total bases in 13617 reads from 3 samples will be used for learning the error rates.
  3. Denoise remaining samples ...............

The .err file is as follows.
Loading required package: Rcpp
Duplicate sequences in merged output.
Duplicate sequences in merged output.
Duplicate sequences in merged output.

Hi @terren, please see the following note in the DADA2 docs on estimating runtime:

https://benjjneb.github.io/dada2/bigdata.html#how-long-does-it-take

A post was split to a new topic: RDS output from DADA2

This topic was automatically closed 31 days after the last reply. New replies are no longer allowed.