Hey there Qiime2 Gurus:
I was running the following command when I got the resulting error:
qiime alignment mafft --i-sequences rep-seqs.qza --o-alignment mafft_aligned-rep-seqs.qza --p-n-threads 25 --verbose
Error:
inputfile = orig
1109268 x 151 - 90 d
nthread = 25
The number of sequences must be < 1000000
Please try the --parttree option for such large data.
Traceback (most recent call last):
File "/scif/apps/miniconda3/envs/qiime2-2018.6/lib/python3.5/site-packages/q2cli/commands.py", line 274, in call
results = action(**arguments)
File "", line 2, in mafft
File "/scif/apps/miniconda3/envs/qiime2-2018.6/lib/python3.5/site-packages/qiime2/sdk/action.py", line 232, in bound_callable
output_types, provenance)
File "/scif/apps/miniconda3/envs/qiime2-2018.6/lib/python3.5/site-packages/qiime2/sdk/action.py", line 367, in callable_executor
output_views = self._callable(**view_args)
File "/scif/apps/miniconda3/envs/qiime2-2018.6/lib/python3.5/site-packages/q2_alignment/_mafft.py", line 61, in mafft
run_command(cmd, aligned_fp)
File "/scif/apps/miniconda3/envs/qiime2-2018.6/lib/python3.5/site-packages/q2_alignment/_mafft.py", line 27, in run_command
subprocess.run(cmd, stdout=output_f, check=True)
File "/scif/apps/miniconda3/envs/qiime2-2018.6/lib/python3.5/subprocess.py", line 398, in run
output=stdout, stderr=stderr)
subprocess.CalledProcessError: Command '['mafft', '--preservecase', '--inputorder', '--thread', '25', '/scratch/local/qiime2-archive-ai5yqr95/2de18955-5810-434d-b796-cec9fda0a13b/data/dna-sequences.fasta']' returned non-zero exit status 1
Plugin error from alignment:
Command '['mafft', '--preservecase', '--inputorder', '--thread', '25', '/scratch/local/qiime2-archive-ai5yqr95/2de18955-5810-434d-b796-cec9fda0a13b/data/dna-sequences.fasta']' returned non-zero exit status 1
*According to this error, It's suggesting for me to use the --parttree option for my large data-set; however under the qiime alignment mafft --help command, I don't see a --parttree option.
I have also read this post: Alignment error message
which implies that 1) it could be a memory issue, but im allowing 50gb of memory, which the job used < 2gb before it threw up this error; and 2) this error is an issue w/ the mafft developers, but I didn't see the --parttree output in the error with those posts.
Any suggestions?