qiime feature-classifier fit-classifier-naive-bayes \ ... KILLED

devonorourke · September 1, 2020, 1:16pm

I can get a similar warning when running QIIME 2020.6, but it doesn't crash the program. My guess is that you're running out of memory - I work with an entirely different marker gene, but generally require about 70-100 GB RAM to complete the fit-classifier jobs. This can vary depending on the number and length of reference sequences, of course. There must be folks on the forum that have a good estimate for how much memory you'll need to do this ( calling on @SoilRotifer @Nicholas_Bokulich...)

You might also not need to bother with this - have you checked out the RESCRIPt tutorial on how to get SILVA data? Those folks have really simplified the process, and it might save you some headaches in trying to format the dataset yourself.

In summary:

If you really want to build your own SILVA classifier, start by throwing way more memory at the process if you have it. Like 50 GB RAM, and see if the job still fails after 1 minute, and whether you get an "out of memory" error. If you don't have that much memory, you can always do it in a cloud compute environment like AWS (or Azure, Google, etc.).
You might be able to avoid all of the previous point by gathering pre-formatted data from RESCRIPt.

Good luck!