Question about Docker build and installation location

cjfields · January 17, 2024, 5:06pm

I wanted to ask a general question about the Docker image setup that is currently used for QIIME2, as we've seen issues with this on our local HPC when running this under Singularity, and may affect portability of workflows if we want to use this container within a standard workflow system (Nextflow, Cromwell/WDL, etc.) on our cluster or elsewhere.

In short, it appears QIIME2 is installed in the container under /home/qiime2, but this appears to conflict with the file system design for used that we have on our cluster, which is pretty common for HPC. More specifically, user- or group-specific directories are all under a /home root , for example my user space is under /home/a-m/cjfields and our lab space is under /home/groups/hpcbio.

Singularity on many systems will automatically bind the host file system, so when it tries to access the pre-generated cache under the container's /home/qiime2 space it is actually trying to access the host file system /home/qiime2 (and gets a permission denied). We can work around this in the short term to some extent by preventing the initial host file system binding, and then remapping the file system over to another location in the container, described very shortly here:

This fixes our specific case and allows the cache to be accessed and downstream tools (feature-classifier for example), but obviously causes issues in the longer term as it also requires remapping the local filesystem file paths used as input (which we use a wrapper script for), and doesn't solve the problem should others utilize HPC resources that have a similar file system structure and that don't allow Docker but utilize singularity/apptainer.

One proposed solution our IT group suggested would be to have QIIME2 and the cache installed under another base-level directory that isn't commonly used for user-specific locations, for example /opt. But I am open to any suggestions.

Oddant1 · January 22, 2024, 6:09pm

Hello @cjfields,

QIIME 2 recently overloaded the term cache. There are two caches in QIIME 2 now, the more recently introduced artifact cache, and the cli cache that has been around for as long as QIIME 2. The cache causing issues here is the cli cache which confuses me because you said this was working on older versions of QIIME 2, but the relevant code hasn't been touched in 8 years!

Unfortunately, there is no easy way to directly specify where the cli cache is created. It probably managed to not cause issues for the last 8 years so was never a priority. If you are using a conda environment, it gets put in the var directory in the conda environment. Otherwise, we use click.get_app_dir to get the directory the cache will be put in (click is the command line library we use for q2cli), and it is this click method that is getting you the home directory.

So, a couple of things. First can you please tell me the last version of QIIME 2 this process worked for? I'd like to try to determine what changed because as far as I can tell neither our code nor the behavior of click.get_app_dir has changed for some time. Additionally, the code that controls where the cache ends up is here.

We can look into adding a way to directly configure where the cli cache is created, but I have no timeline for that. It never came up previously. Right now the best (albeit hacky) solution to control where this cache is put is probably going to be manually setting the CONDA_PREFIX envvar. You can try setting that to get the cache created in the /opt directory or something similar.

cjfields · February 19, 2024, 4:55pm

@Oddant1 Sorry, missed your reply.

I should clarify this based on checking back at the prior analyses. When I say it "worked" in older Docker versions: it did, but only for simple import/export of data as mentioned before in the forum link above. I went back and retested the same steps using older Docker images converted to Singularity but trying to use q2-feature-classifier, and ran into the same permissions issues as before.

My point with the above is that we would likely run into problems when incorporating QIIME2 into a standard workflow (e.g., WDL, Snakemake, Nextflow, etc) on HPC that use the base /home directory for users, a fairly common practice.

cjfields · April 27, 2025, 3:33am

Hi @Oddant1 I wanted to check in on this, since we're trying to add a QIIME2-reliant step in our workflow. We're effectively stuck testing this on our local HPC due to the conflicts mentioned above. In the meantime we're fine with trying a forked version of Docker files to generate and test alternative containers, but I'm not sure where to look for documentation or the latest Dockerfiles that are used for generating these. Any pointers on that front would be greatly appreciated!

SoilRotifer · April 27, 2025, 3:31pm

Hi @cjfields,

I am not sure if this will help, but here is how I run the QIIME 2 docker containers on our HPC:

load apptainer (seems to work better than singularity for me):
module load apptainer
set cache, then go to directory:
export APPTAINER_CACHEDIR=/path_to_containers
cd $APPTAINER_CACHEDIR
get container:
apptainer pull docker://quay.io/qiime2/metagenome:2024.10

make container writable:
mkdir sandboxes

apptainer build --sandbox ./sandboxes/metagenome-2024.10 ./metagenome_2024.10.sif

From here you can either run an interactive session:

apptainer shell --writable ./sandboxes/metagenome-2024.10
source tab-qiime
qiime info

or run a job:

apptainer exec \
    --writable /your-path/sandboxes/metagenome-2024.10 \
        qiime moshpit classify-kraken2 \
            --i-seqs ./path-to-cache:megahit_contigs \
            --i-kraken2-db ./path-to-cache:kraken_pfp \
            --p-threads 56 \
            --p-confidence 0.6 \
            --p-minimum-base-quality 20 \
            --p-num-partitions 4 \
            --o-reports ./path-to-cache:kraken_reports_contigs \
            --o-hits ./path-to-cache:kraken_hits_contigs \
            --verbose

So far, I've not had any trouble running either qiime2-amplicon-2024.10 or qiime2-moshpit-2024.10 this way... I am not an expert on containers, but I just wanted to share what currently works for me.

-Mike

cjfields · May 15, 2025, 1:28pm

Hi @SoilRotifer thanks for the suggestion! This would fix the problem locally, but the main issue has been the ability to include this in a workflow that is portable across our various local HPC resources and containerization options. We work with collabs who use our workflows elsewhere, so having portable options for workflow deployment is important.

Saying all of that, it looks like this was addressed recently (note the /home paths are now gone). This made it into the recent 2025.4 release which I tested, and it seems to have addressed the issue! Thx @Oddant1 for that !

Oddant1 · May 15, 2025, 5:18pm

Sorry I forgot to specifically tell you about that here!