Apptainer
Apptainer Overview
Apptainer, formerly known as Singularity, is a container system for shared environments that allows you to bundle your entire environment into a single file that can be run on almost any system (including all CSaW cluster systems). One of its great features is that it allows you run many Docker images that have become popular among many computing researchers who are used to working on their personal systems and want to move their environment to a different system.
This document will walk you through the basics of pulling or building a container, and running it in the CSaW cluster environment. For full Apptainer documentation, please see the Apptainer User Guide.
Note
All Apptainer commands will need to be run from an Execute Point (EP), also known as a compute node. You can get an interactive shell on an EP using condor_submit’s interactive option:
condor_submit -interactive request_cpus=4 request_memory=4096 getenv=True
Using Existing Images
The easiest way to get started with Apptainer is to leverage the work someone else has done. Many times you will be able to find an image that someone has built that will contain the tool(s) you need to run your research.
Apptainer has access to both a vast library of images built specifically for it, and also to the even bigger library at Docker Hub (a popular repository with hundreds of thousands of Docker images).
Images can be pulled down to the cluster environment from various
sources using the apptainer pull
command. Alternatively, you can skip the pull
and just specify the
URL to the build
, exec
, run
, or shell
apptainer
commands to create an ephemeral container that will go away after the
command is run.
Caution
Be careful where you run the apptainer command. If it
needs to download an image, the image will be downloaded in your
current directory. Container images can be larger than your quota
allows, so it is suggested to store them somewhere in your shared
research directory (/cluster/research-groups/${PI}
) unless
you’re sure you have enough space.
From Docker Hub
Docker Hub has over 100,000 pre-built images. Unfortunately, not all Docker images can be run in Apptainer, but most HPC/HTC and scientific research ones run just fine. See the notes on Docker compatibility section for additional details. From the Docker Hub web site you can search through thousands of pre-built images. Be mindful that not all containers can be trusted, and be sure to favor containers that are marked as “Trusted Content” either via the “Docker Image Official”, “Verified Publisher”, or “Sponsored OSS” badges.
Let’s pull a pre-built image from David Godlove (who is conveniently an Apptainer/Singularity developer!), “lolcow” which will display a fortune message in the terminal.
[USER@c-X-X ~]$ apptainer pull docker://godlovedc/lolcow
INFO: Converting OCI blobs to SIF format
INFO: Starting build...
Copying blob 8e860504ff1e done |
Copying blob d010c8cf75d7 done |
Copying blob 3b61febd4aef done |
Copying blob 7fac07fb303e done |
Copying blob 9fb6c798fa41 done |
Copying blob 9d99b9777eb0 done |
Copying config 38dc06177c done |
Writing manifest to image destination
2024/08/05 11:44:19 info unpack layer: sha256:9fb6c798fa41e509b58bccc5c29654c3ff4648b608f5daa67c1aab6a7d02c118
2024/08/05 11:44:19 warn rootless{dev/agpgart} creating empty file in place of device 10:175
2024/08/05 11:44:19 warn rootless{dev/audio} creating empty file in place of device 14:4
... CUT FOR SPACE ...
024/08/05 11:44:19 warn rootless{dev/urandom} creating empty file in place of device 1:9
2024/08/05 11:44:19 warn rootless{dev/zero} creating empty file in place of device 1:5
2024/08/05 11:44:20 info unpack layer: sha256:3b61febd4aefe982e0cb9c696d415137384d1a01052b50a85aae46439e15e49a
2024/08/05 11:44:20 info unpack layer: sha256:9d99b9777eb02b8943c0e72d7a7baec5c782f8fd976825c9d3fb48b3101aacc2
2024/08/05 11:44:20 info unpack layer: sha256:d010c8cf75d7eb5d2504d5ffa0d19696e8d745a457dd8d28ec6dd41d3763617e
2024/08/05 11:44:20 info unpack layer: sha256:7fac07fb303e0589b9c23e6f49d5dc1ff9d6f3c8c88cabe768b430bdb47f03a9
2024/08/05 11:44:20 info unpack layer: sha256:8e860504ff1ee5dc7953672d128ce1e4aa4d8e3716eb39fe710b849c64b20945
INFO: Creating SIF file...
Here the docker:// prefix without a specific host following it is used to specify it should come from the default Docker Hub container registry. You could alternatively supply a hostname to tell it to use a different OCI compatible container registry, such as Quay.io or Github Container Registry if the project you’re using needs that instead.
Once you pull the image, it will create a new file in your current directory named lolcow_latest.sif. The image name is suffixed with _latest to denote which tag was used when creating it. In this case we did not specify a tag (tags can be used to denote specific versions), so it simply grabbed the latest.
Note
If reproducibility is important to your work, you can specify a tag
to pull by ending the URL with a :tag
such as
python:3.9.21-alpine3.21
, so that others can use the exact same
image you used. Otherwise six-months from now latest could be
updated and provide different results.
You might have also noticed all the warnings when it was converting the image to the Apptainer format. This is because Docker containers run with elevated privileges unlike in Apptainer.
Now that you have an image downloaded, let’s look at options to run it before we deploy it via HTCondor.
Running Containers
There are three ways you can use an Apptainer container: run
,
exec
, and shell
.
The commands run
and exec
are similar, with one exception:
run
will run whichever command is defined to be run by the
Apptainer image, while exec
will let you specify which command
to run.
The last command mentioned above, shell
, will simply give you a
shell into the container and let you interact with it. This is a good
way for testing before submitting non-interactive jobs to the
scheduler.
Run
Using the example image pulled above, we can view the difference
between all three options for running a container. First the run
command:
USER@c-X-X:~$ apptainer run lolcow_latest.sif
________________________
< Don't get to bragging. >
------------------------
\ ^__^
\ (oo)\_______
(__)\ )\/\
||----w |
|| ||
One of the neat features about Apptainer is that its images are also executable! So you can just run the image like it was a normal executable for your system:
USER@c-X-X:~$ ./lolcow_latest.sif
_________________________________________
< If you can read this, you're too close. >
-----------------------------------------
\ ^__^
\ (oo)\_______
(__)\ )\/\
||----w |
|| ||
Note
The run
command will also let you pass arguments to whatever
program is going to run, but the lolcow container does not take
any arguments. Here’s a small example of using it for future
reference:
apptainer run ./my_container.sif -arg1 -arg2
Exec
The run
command above ran the default program (or programs) when
the container started, but sometimes you need to run something
slightly differently in the same container. The previous run
example was invoking the shell pipeline: fortune | cowsay |
lolcat
. This is not directly possible using exec
, because we can
only run one program, and not a pipeline. However, instead of using
all of the pipeline we can run things individually, such as just
fortune or cowsay with our own pre-determined messages:
USER@c-X-X:~$ apptainer exec lolcow_latest.sif cowsay "Hello, world!"
_______________
< Hello, world! >
---------------
\ ^__^
\ (oo)\_______
(__)\ )\/\
||----w |
|| ||
Here we’ve run cowsay with our message “Hello, world!” (The "
’s
are important to pass the message as one argument to the program). We
could also run fortune on its own:
USER@c-X-X:~$ apptainer exec lolcow_latest.sif fortune
You will triumph over your enemy.
exec
let us run something from inside the container other than the
pre-defined program, which can be very useful if you have additional
testing tools or software in there. And for one final example, I’ll
show how you can run a pipeline of commands using exec
even though
I previously said you couldn’t do it:
USER@c-X-X:~$ apptainer exec lolcow_latest.sif sh -c 'fortune | cowsay | lolcat'
______________________________________
< Stay away from flying saucers today. >
--------------------------------------
\ ^__^
\ (oo)\_______
(__)\ )\/\
||----w |
|| ||
So the command we ran was sh
, which is a POSIX shell. Then we
passed it two options: -c
and 'fortune | cowsay | lolcat'
. The
-c
says the next argument should be run in the shell, and the next
argument was a pipeline of commands grouped together with quotes, so
it’s only one argument.
Now that we’ve seen the difference between run
and exec
let’s
look at using the container in an interactive manner, using the
shell
command.
Shell
Sometimes testing things interactively is the easiest way. Apptainer support this concept by allowing you to get a shell prompt inside the container. This lets you run multiple commands, look at available files, and see the environment as it exists when it’s run in order to troubleshoot any problems that occur.
The shell
command will simply start the container, and give you a
shell prompt into it. You’ll know you’re in your container when the
prompt changes to Apptainer>
instead of your
USER@cluster-node$:
prompt.
USER@c-X-X:~$ apptainer shell ./lolcow_latest.sif
Apptainer>
From here you can inspect, test, and interact the environment.
USER@c-X-X:~$ apptainer shell ./lolcow_latest.sif
Apptainer> whoami
USER
Apptainer> id
uid=######(USER) gid=######(USER) groups=######(USER),65534(nogroup) context=unconfined_u:unconfined_r:unconfined_t:s0-s0:c0.c1023
Apptainer> pwd
/cluster/home/USER
Apptainer> date
Fri Oct 1 10:24:07 PDT 2021
Apptainer> fortune | cowsay | lolcat
________________________________________
/ If you learn one useless thing every \
| day, in a single year you'll learn 365 |
\ useless things. /
----------------------------------------
\ ^__^
\ (oo)\_______
(__)\ )\/\
||----w |
|| ||
Now that we can run things interactively, we see that we are our normal user, have our normal user privileges, and our home directory is mounted in the container. Apptainer has preserved (almost) everything for us so we can access our data, but we’re running in our special container environment.
Other Options
Mounting Other Directories
In the Apptainer Shell example I
mentioned that Apptainer preserved almost everything from your
normal environment. What did it miss? Well, it got your UID/GID,
groups, some directories, and environment variables. But it didn’t get
the entire filesystem. This becomes a problem if you need access to
additional directory, since it’s not mounted into your container
environment. Fortunately there’s an easy fix for this: the --bind
argument.
Note
Here’s the complete list that comprises the “some directories” mentioned above according to the Apptainer documentation:
In the default configuration, the system default bind points are
$HOME
,/sys:/sys
,/proc:/proc
,/tmp:/tmp
,/var/tmp:/var/tmp
,/etc/resolv.conf:/etc/resolv.conf
,/etc/passwd:/etc/passwd
, and$PWD
.
In addition to the defaults above, all nodes in the CSaW cluster
are configured to also automatically mount the following:
/cluster
, /scratch
, /scratch_memory_backed
.
Because of the added mounts above, you will most likely not need to add additional bind mounts.
Using the --bind
argument can be done two ways, first where you
simply specify the path and it appears as specified, second where you
specify the path on the host, and a different path inside the
container. You may bind multiple directories by separating them with a
,
. Here’s an example of both options:
apptainer run --bind "/projects" lolcow_latest.sif
This will create a /projects
inside your container, and its
contents will be that of /projects
from the host that is running
the container.
Apptainer run --bind "/projects:/research_data"
lolcow_latest.sif
This second example takes the /projects
from the host that runs
the container, but puts it in as /research_data
instead. This can
be useful if your research application expects data to be available at
a fixed location, and you can’t adjust it.
Using a GPU
The Apptainer commands run
, exec
, and shell
can all accept
the --nv
argument that will automatically configure the container
to be able to use the GPU(s). HTCondor will automatically set the
appropriate environment variable (CUDA_VISIBLE_DEVICES) when you set
request_gpus >= 1 in your submission file so that your program will
use the correct device(s) on a multi-GPU system.
Submitting to HTCondor
Submitting a job using Apptainer is done in much the same way you submit any other job. You can use Apptainer in HTCondor’s Vanilla, Parallel, and Container universes. Using the Parallel universe will not be covered here, but is possible. Please see the Apptainer MPI documentation for an idea of the required work to begin running MPI inside a container, and please reach out to support for additional help.
Using the Vanilla Universe
Submitting to the Vanilla universe is easy, and only requires a handful of lines to run the container:
1universe = vanilla
2
3executable = apptainer
4arguments = run lolcow_latest.sif
5
6output = out.log
7error = err.log
8log = condor.log
9
10queue
This basic example is enough to submit and run the lolcow image, but
it could be even smaller because if you’re using the run
command
you can leverage the fact that the image itself is executable:
1universe = vanilla
2
3executable = lolcow_latest.sif
4
5output = out.log
6error = err.log
7log = condor.log
8
9queue
This is perhaps the smallest submission file possible that still has enough to get all of the things you need. Of course if you need to specify arguments to your container you can add back in the Arguments variable.
Because Apptainer is started by HTCondor it will have whatever resource limits you requested. The above examples will get a single core, and some small amount of memory space. It’s better to add the two lines in that request a reasonable amount of resources:
1universe = vanilla
2
3executable = lolcow_latest.sif
4
5output = out.log
6error = err.log
7log = condor.log
8
9request_cpus = 1
10request_memory = 1gb
11
12queue
This will allow your container to use multiple cores, and request a reasonable amount of memory to use. You should of course adjust your requests based on your needs, but please remember that we’re in a shared environment. If HTCondor has allocated the resources to you and don’t use them, it’s potentially taking away resources from someone else who could use them.
Using a GPU in the Vanilla Universe
Building on the previous example above, we can pretend that our
lolcow container needs a GPU to do its job. As mentioned above in
the using a gpu we need to pass the
--nv
flag to tell Apptainer to allow the container access to a
GPU, as well as request a GPU via HTCondor.
1universe = vanilla
2
3executable = apptainer
4arguments = run --nv lolcow_latest.sif
5
6output = out.log
7error = err.log
8log = condor.log
9
10request_cpus = 8
11request_memory = 32gb
12request_gpus = 1
13
14queue
Using the Container Universe
HTCondor has a universe dedicated to running containers. This can make using them even easier than the vanilla universe, because HTCondor will do the heavy lift of ensuring that the container is present and Apptainer is available on the host before starting it.
Leveraging a Container Registry
The easiest way to leverage the container universe is to specify the container image as a URL to the OCI repository, or use the default, Docker Hub.
1universe = container
2container_image = docker://godlovedc/lolcow
3
4executable = /bin/sh
5arguments = "-c 'fortune | cowsay | lolcat'"
6
7output = out.log
8error = err.log
9log = condor.log
10
11request_cpus = 1
12request_memory = 1gb
13
14queue
Note the executable is a path inside the container. HTCondor will start the container, then start a process inside the container, much like when the “exec” functionality is shown above. We specify the full path to the program inside the container, and then the arguments to run. The odd double quotes and single quotes is because we need to tell HTCondor that the arguments are a string, but there’s special meaning to the way it handles spaces for our command. Depending on your program, you will probably not need any weird quoting like this example does.
Transferring the Container Image
If you’ve built your own image but haven’t put it in an OCI registry, you will need to transfer the image to the execute point. Fortunately, this is only two extra lines. One line to tell HTCondor we want to transfer the container, and one tell HTCondor we want to ensure that files are transferred. The second line is only required because HTCondor knows that we have shared file storage and by default doesn’t want to transfer files.
1universe = container
2container_image = lolcow_latest.sif
3transfer_container = true
4should_transfer_files = yes
5
6executable = /bin/sh
7arguments = "-c 'fortune | cowsay | lolcat'"
8
9output = out.log
10error = err.log
11log = condor.log
12
13request_cpus = 1
14request_memory = 1gb
15
16queue
Using a GPU with the Container Universe
One of the nice things about the container universe is that because it invokes the Apptainer command for you, it also knows when it needs to tell the container about the GPU you requested. This means there is no need to do anything differently, except request the GPU.
1universe = container
2container_image = docker://godlovedc/lolcow
3
4executable = /bin/sh
5arguments = "-c 'fortune | cowsay | lolcat'"
6
7output = out.log
8error = err.log
9log = condor.log
10
11request_cpus = 8
12request_memory = 32gb
13request_gpus = 1
14
15queue
The above example will of start the container with access to 1 GPU, 8
cpu threads, and 32 GB of memory. The GPU that HTCondor has allocated
for you will be set with CUDA_VISIBLE_GPUS, which is not the same as
what is visible with nvidia-smi
. Apptainer is working on a fix
for this issue, but as of now it is not available.
And as above, please only request resource you intend to fully use. If your code does not use a GPU, please do not request one. They are far fewer GPUs available in the clusters than there are CPU cores.
Building Your Own containers
While there are hundreds of thousands of containers between Docker Hub and various registries, sometimes you can’t find one that does what you need it to do.
Writing an Apptainer Definition File (.def)
An Apptainer Definition File (.def) contains to parts: A header, and sections.
The header specifies what to use as the base of the image. This can be anything from a basic Linux environment, another .sif image, or a bare environment that you copy everything into.
The sections can be thought of as little scripts that define what to do when during the build process, as well as the runscript which defines what to do when the container is run.
The first line of the file will always contain Bootstrap: to define how to begin building the image. As to which bootstrap method to use, it depends on the use case. But it is perhaps easiest to use an image from the library, such that it’s one that you’re familiar with, and then modify it to suit your needs in the sections portion of the file. Please see the Header section of the Apptainer documentation to see all possible bootstrap options, as well as how to use them in their appendix sections.
Once the header is defined, the sections part will be one or more of the following sections:
To use the same example we’ve been using throughout this document, the lolcow image is built from the following:
1BootStrap: docker
2From: ubuntu:16.04
3
4%post
5 apt-get -y update
6 apt-get -y install fortune cowsay lolcat
7
8%environment
9 export LC_ALL=C
10 export PATH=/usr/games:$PATH
11
12%runscript
13 fortune | cowsay | lolcat
14
15%labels
16 Author GodloveD
The header is the two lines:
1BootStrap: docker
2From: ubuntu:16.04
This will use the existing Ubuntu 16.04 from Docker Hub as a base to work with. (Note: The Ubuntu 16.04 image is very old, and should not be used for anything modern. It’s kept here only as an example)
The sections then becomes the rest:
1%post
2 apt-get -y update
3 apt-get -y install fortune cowsay lolcat
4
5%environment
6 export LC_ALL=C
7 export PATH=/usr/games:$PATH
8
9%runscript
10 fortune | cowsay | lolcat
11
12%labels
13 Author GodloveD
The %post
section is the area where you can run commands to install software
inside the image while building it. Here we use Ubuntu’s apt
utility to update its cache, and install our three tools: fortune
,
cowsay
, and lolcat
.
The %environment section sets environment variables for when the container is run. While Apptainer will inherit your current environment values, these will set and/or ensure that they are set to their intended values. Here we ensure that our locale is defined as the C locale, and that the PATH is updated to include the directory that has the commands we intend to run.
The %runscript section defines what commands to run when you invoke the container. Either with apptainer run or ./lolcow.sif.
And finally, the %labels is where the David Godlove gives himself credit for creating this great example image.
There are many additional sections and things you can do in your images. Please see the full Apptainer Documentation for Definition Files to learn more. If you need help writing your .def files, please reach out to support for assistance.
Building the .sif File
The apptainer build command supports the fakeroot feature, which allows you to build container images without needed special privileges. We’ll re-use the above example to build our own lolcow.sif.
1BootStrap: docker
2From: ubuntu:16.04
3
4%post
5 apt-get -y update
6 apt-get -y install fortune cowsay lolcat
7
8%environment
9 export LC_ALL=C
10 export PATH=/usr/games:$PATH
11
12%runscript
13 fortune | cowsay | lolcat
14
15%labels
16 Author GodloveD
Saving the above as lolcow.def
, we can then issue the remote build
command, see the build status, and pull the built image.
apptainer build lolcow.sif lolcow.def
USER@c-X-X:~$ apptainer build lolcow.sif lolcow.def
INFO: User not listed in /etc/subuid, trying root-mapped namespace
INFO: The %post section will be run under fakeroot
INFO: Starting build...
Copying blob 3713021b0277 done |
Copying config 8a3cdc4d1a done |
Writing manifest to image destination
2024/08/05 16:17:51 info unpack layer: sha256:3713021b02770a720dea9b54c03d0ed83e03a2ef5dce2898c56a327fee9a8bca
INFO: Running post scriptlet
+ apt-get -y update
... CUT FOR SPACE ...
+ apt-get -y install fortune cowsay lolcat
... CUT FOR SPACE ...
done.
INFO: Adding labels
INFO: Adding environment to container
INFO: Adding runscript
INFO: Creating SIF file...
INFO: Build complete: lolcow.sif
This uploaded our lolcow.def file to their build server, showed us the build output (almost all of which was cut to keep the display small), and then downloaded the image to the requested lolcow.sif name.
Now we can run the image as we have the past:
USER@c-X-X:~$ apptainer run lolcow.sif
______________________________________
< Today is what happened to yesterday. >
--------------------------------------
\ ^__^
\ (oo)\_______
(__)\ )\/\
||----w |
|| ||
There are many more things you can do with remote builds, but that covers the basics. The full Apptainer documentation pages cover many of the additional options, but please contact support if you need additional help.
Building With a Personal System
While out of the scope of this document, it’s worth mentioning that due to the nature of Apptainer containing everything it needs in one file, you are able to build your .sif and upload it to be run on the cluster environment.
Unsupported Features
Apptainer supports many more features than are mentioned on this page. Unfortunately, some features can’t be used due to security concerns when operating in a shared environment because they require elevated privileges.
Fakeroot Containers
If you need to a run a process as root inside your container, it’s possible to map root’s ID inside the container back to that of your user outside the container. The process believes it is being run as root, and in reality it’s running as you. Depending on what the process needs the root privileges for this may or may not work as intended. The use of this feature requires specialized map files to be generated for each user in the environment, on each host that the container will run on. This can be done, but due to the number of users it will only be done on a case-by-case basis. If you think you need this feature, please reach out to support and we’ll see if we can help you get your code running.
Notes on Docker Compatibility
Because Docker initially runs as the root user (when a non-root user
is granted access to the Docker group they have root access), there
are certain features that it allows that are not compatible with
Apptainer. One of these features is dropping privileges to another
user. In a well crafted Docker file you may encounter the USER
directive to become a different user instead of root. Because
Apptainer runs as you, you can’t become someone else, only root can
do that. Docker containers that depend on this functionality will not
work without modification. Fortunately this should be a rare
occurrence with research containers, and is typically found in service
containers instead.
For a full list of things that function differently between Docker and Apptainer please see the best practices and troubleshooting section of the Apptainer user documentation.
Additional Notes / Tips
Apptainer Cache
The cluster environments have by default set your Apptainer cache directory to be inside the /scratch directory, which does not travel with you from node to node. All files in that directory will be deleted if they have not been accessed for more than a day.
Setting your cache path to a scratch directory helps ensure you do not fill up your home directory with parts of an Apptainer image.
Clearing Out Your Cache
If you’ve downloaded a bunch of Apptainer images and want to clean up the cache before the system cleans them up, you can run the apptainer cache clean command. Please see the linked documentation for additional information.