Hi everyone,
I’ve been encountering an issue with QIIME 2 while running the classify-sklearn command for taxonomy classification, and I’m hoping someone can help me figure this out. Thanks for your kindness for helping me doing the work.
1.My Setup:
CPU Intel i9-13900H (14 cores, 20 threads)
Memory 32 GB RAM
Disk: Over 100 GB free on Windows disk (D:)
Environment: WSL2 running Ubuntu 20.04 (with Python and QIIME 2 installed)
I ran the following command for taxonomy classification:
However, the process gets abruptly terminated with the message: Killed(four times)
3 Observations:
My D: disk (used for WSL data) lost ~30 GB of space after the failure, but I can’t figure out what files were created or where the disk space went.
I don't know why it turned out killed,
I’d sincerely appreciate any insights you can provide into why this issue might be occurring and how to resolve it. If there are logs or troubleshooting steps I can follow to identify the cause, I’d be happy to provide any additional information needed.
I apologize for interrupting you once again, but I continue to experience difficulties with my classifier workflow, and I would be extremely grateful for any further guidance you can offer.
I am running the following command on a rented Ubuntu cloud server (32 GB RAM, 16 cores):
Despite ample system resources, the process is abruptly terminated with the message “Killed.”
Any insights into why this might be happening—or suggestions for additional troubleshooting steps—would be greatly appreciated.Thank you very much for your patience and continued assistance.
Kind regards,
Thank you so much for your suggestion. Unfortunately, I don't have access to an HPC system or >64 GB of RAM at the moment—I'm limited to using a cloud server (maybe trying a larger one is necessary).
However, when I tried the version without singletons (unite_ver10_dynamic_all_19.02.2025-Q2-2024.10.qza), it didn't work with my 32 GB server; perhaps more RAM is necessary.
Additionally, how can I perform taxonomy classification on rare sequences without consuming too much RAM? Retaining these rare sequences is important for my analysis, so I'm in a bit of a bind.
Do you have any additional recommendations?
By the way I explored the use of the BLAST+ method for taxonomy classification of rare sequences. This approach fit my memory usage, making it suitable for my available resources.
That's what I was going to suggest next! I find the search based classifiers like blastn and vsearch work okay for my needs.
Unite with singletons and without singletons changes what's in the database, and I suppose how well it works on your input data. But all sequences from your input features will be retained with any Qiime2 classifier.
Thank you, Sir. Additionally, I have borrowed a server with sufficient capacity and will inform you if the classifier works. Anyway, thanks for your kind help again!