Hi guys,
I’m trying DADA2 on an artifact made up of 32 samples (~3.15 Gig) which starts to run fine but end up getting stuck on selfConsist step 8 for now over 24hrs. I’m wondering if its just stuck on loop somehow or maybe this expected length of time?
There is 12G of Ram and 7 cores dedicated to my Virtual Machine (running Docker and qiime 2017.7) for this and about 40 Gigs of free space on the SSD partition that is running on. Everything being run from the containing folder.
qiime dada2 denoise-paired \
--i-demultiplexed-seqs paired-end-demux.qza \
--p-trunc-len-f 270 \
--p-trunc-len-r 230 \
--p-n-threads 7 \
--o-table table-dada2.qza \
--o-representative-sequences rep-seqs.qza \
--output-dir unspec-results \
--verbose
Everything up till Step 8 was being processed in pretty good speed (2-3 hours) but then it gets stuck and has been stuck for close to 24 hours now. Just wondering if this is expected behavior or somehow stuck in a loop.
The system is working pretty hard with the fan going full throttle and close to %100 of the CPU being used.
Running external command line application(s). This may print messages to stdout and/or stderr. The command(s) being run are below. These commands cannot be manually re-run as they will depend on temporary files that no longer exist. Command: run_dada_paired.R /tmp/tmp4wxo5i3p/forward /tmp/tmp4wxo5i3p/reverse /tmp/tmp4wxo5i3p/output.tsv.biom /tmp/tmp4wxo5i3p/filt_f /tmp/tmp4wxo5i3p/filt_r 270 230 0 0 2.0 2 consensus 1.0 7 1000000 R version 3.3.1 (2016-06-21) Loading required package: Rcpp Warning messages: 1: multiple methods tables found for ‘arbind’ 2: multiple methods tables found for ‘acbind’ 3: replacing previous import ‘IRanges::arbind’ by ‘SummarizedExperiment::arbind’ when loading ‘GenomicAlignments’ 4: replacing previous import ‘IRanges::acbind’ by ‘SummarizedExperiment::acbind’ when loading ‘GenomicAlignments’ 5: multiple methods tables found for ‘left’ 6: multiple methods tables found for ‘right’ DADA2 R package version: 1.4.0 1) Filtering ............................... 2) Learning Error Rates 2a) Forward Reads Initializing error rates to maximum possible estimate. Sample 1 - 52120 reads in 25033 unique sequences. Sample 2 - 55364 reads in 26544 unique sequences. Sample 3 - 47665 reads in 22116 unique sequences. Sample 4 - 78223 reads in 35775 unique sequences. Sample 5 - 232027 reads in 92241 unique sequences. Sample 6 - 115288 reads in 48823 unique sequences. Sample 7 - 14755 reads in 8138 unique sequences. Sample 8 - 40034 reads in 20349 unique sequences. Sample 9 - 21558 reads in 10705 unique sequences. Sample 10 - 16360 reads in 8655 unique sequences. Sample 11 - 34443 reads in 15624 unique sequences. Sample 12 - 6287 reads in 4123 unique sequences. Sample 13 - 52638 reads in 20303 unique sequences. Sample 14 - 8961 reads in 4801 unique sequences. Sample 15 - 33394 reads in 15948 unique sequences. Sample 16 - 10576 reads in 6072 unique sequences. Sample 17 - 30462 reads in 15299 unique sequences. Sample 18 - 67234 reads in 28569 unique sequences. Sample 19 - 44296 reads in 21446 unique sequences. Sample 20 - 57836 reads in 26209 unique sequences. selfConsist step 2 selfConsist step 3 selfConsist step 4 selfConsist step 5 selfConsist step 6 selfConsist step 7 selfConsist step 8
When I tried to terminate the process ctrl + C
, it carries on with the following
selfConsist step 9 selfConsist step 10 Warning message: In dada(drpsF, err = NULL, selfConsist = TRUE, multithread = multithread) : Self-consistency loop terminated before convergence. 2b) Reverse Reads Initializing error rates to maximum possible estimate. Sample 1 - 52120 reads in 28794 unique sequences. Sample 2 - 55364 reads in 29168 unique sequences. Sample 3 - 47665 reads in 26478 unique sequences. Sample 4 - 78223 reads in 43292 unique sequences. Sample 5 - 232027 reads in 111034 unique sequences. Sample 6 - 115288 reads in 60240 unique sequences. Sample 7 - 14755 reads in 8901 unique sequences. Sample 8 - 40034 reads in 23578 unique sequences. Sample 9 - 21558 reads in 12732 unique sequences. Sample 10 - 16360 reads in 9052 unique sequences. Sample 11 - 34443 reads in 19197 unique sequences. Sample 12 - 6287 reads in 4175 unique sequences. Sample 13 - 52638 reads in 27308 unique sequences. Sample 14 - 8961 reads in 5817 unique sequences. Sample 15 - 33394 reads in 18903 unique sequences. Sample 16 - 10576 reads in 7271 unique sequences. Sample 17 - 30462 reads in 18194 unique sequences. Sample 18 - 67234 reads in 44234 unique sequences. Sample 19 - 44296 reads in 26943 unique sequences. Sample 20 - 57836 reads in 33664 unique sequences. selfConsist step 2 selfConsist step 3 selfConsist step 4 selfConsist step 5 selfConsist step 6 selfConsist step 7 selfConsist step 8 Convergence after 8 rounds. 3) Denoise remaining samples ........... The sequences being tabled vary in length. 4) Remove chimeras (method = consensus)
and now is continuing to run.
I’m worried I just forced it to abandon some steps and skip through them.
If it helps at all the unspec-results folder it created is currently still empty as well.
Any thoughts?