Hey!
I have seen that this problem has come up several times in your forum and I tried to read through the different topics in order to solve the problem. Although, I unfortunately think I might need some different advice.
I am using QIIME2 2020.8 version. I am using this version on a centralized computing system so they have installed QIIME2. I am not sure how they have installed it unfortuantaly but I know that they use github easybuild to instal programs. All of the jobs are run in a queue system. I have successfully run this batch job:
#!/bin/bash -l
################### QIIME2 Assign Taxonomy ###################
# Section for defining queue-system variables:
#-------------------------------------
# SLURM-section
#SBATCH --account=nn9800k
#SBATCH --job-name=Assign_Taxonomy
#SBATCH --ntasks=2 --cpus-per-task=4
#SBATCH --mem-per-cpu=32G
#SBATCH --partition=bigmem
#SBATCH --time=10-0:0:0
#SBATCH --mail-type=ALL
##SBATCH -M hdschweitzer12@gmail.com
######################################
# Section for defining job variables and settings:
#-------------------------------------
# Command must be ran in same directory where 5_denoise_dada2_seq_file.qza or 12_denoise_dada2_minfreq60_ASVs.qza file is located
# Must ensure path to classifier is correct for the directory you are in
module load QIIME2/2020.8
export job_output_dir=/cluster/work/users/$USER/$SLURM_JOB_ID
# Creating aliases and moving files to scratch:
#submit=$SLURM_SUBMIT_DIR
#temp=$job_output_dir
#mkdir -p $temp
#cd $submit
#cp $input $temp
#cd $temp
# Running program, pay attention to command name:
# Assign_Taxonomy
time qiime feature-classifier classify-sklearn --i-reads 5_denoise_dada2_seq_file.qza --i-classifier ../../../Classifier_files/Classifiers_Silva138.1/18S/silva-138.1-ssu-nr99-139f_EukBr-classifier.qza --o-classification 14_Unfiltered_ASV_taxonomy.qza
qiime feature-classifier classify-sklearn --i-reads 12_denoise_dada2_minfreq60_ASVs.qza --i-classifier ../../../Classifier_files/Classifiers_Silva138.1/18S/silva-138.1-ssu-nr99-139f_EukBr-classifier.qza --o-classification 15_60Filtered_ASV_taxonomy.qza
qiime metadata tabulate --m-input-file 14_Unfiltered_ASV_taxonomy.qza --o-visualization 16_Unfiltered_ASV_taxonomy_visfile.qzv
qiime metadata tabulate --m-input-file 15_60Filtered_ASV_taxonomy.qza --o-visualization 17_60Filtered_ASV_taxonomy_visfile.qzv
# Cleaning up and moving files back to home if desirable:
#cp output.out $submitdir
#cp $input.changed $submitdir
exit 0
It ran successfully when I used a sequence file that was 19KB and 32KB against a trained classifier that was 12MB. But when I try to run a 76KB or a 205KB sequence file against a 160MB classifier I get this error:
Plugin error from feature-classifier:
** [Errno 28] No space left on device**
Debug info has been saved to /tmp/qiime2-q2cli-err-q0_5kqtx.log
If I create a temporary file I am not sure where I can send it. I am suppose to only use my project space. Seems that there is a 15GiB limit on /tmp on nodes. My question is what is the expected sizes of ouputs that is potentially placed in /tmp during calculations? How does the code behaves in relation to storage and file system usage? I believe this problem will have to be addressed with the managers of the centralized computing system but I am not completely sure what size files I am looking at. I was surprised to find that 15GB was not big enough for the tempoary file. This won't even be the largest file that I will run so I need to know what kind of sizes to expect in order to request access to more. Also, if this is not the issue please let me know how else I might be able to address it. Thanks!
Hannah