maxmustermann/nvidia-docker

Fork 0

Felix Abecassis b9294ab2b4 Document the 2.0 alpha release

2017-10-06 17:50:12 -07:00

4.4 KiB

Raw Blame History

Docker Engine Utility for NVIDIA GPUs

This branch contains version 2.0 of the nvidia-docker utility.

Warning: Version 2.0 is in alpha state, it is not intended to be used in production systems.

Differences with 1.0

Doesn't require wrapping the Docker CLI,
Doesn't require starting a separate daemon,
GPU isolation is now achieved with environment variable NVIDIA_VISIBLE_DEVICES,
Can enable GPU support for any Docker image. Not just the ones based on our official CUDA images,
Package repositories are available for Ubuntu and CentOS,
Uses a new implementation based on libnvidia-container.

Removing nvidia-docker 1.0

Version 1.0 of the nvidia-docker package must be cleanly removed before continuing.
You must stop and remove all containers started with nvidia-docker 1.0.

Ubuntu distributions

docker volume ls -q -f driver=nvidia-docker | xargs -r -I{} -n1 docker ps -q -a -f volume={} | xargs -r docker rm -f
sudo apt-get purge nvidia-docker

CentOS distributions

docker volume ls -q -f driver=nvidia-docker | xargs -r -I{} -n1 docker ps -q -a -f volume={} | xargs -r docker rm -f
sudo yum remove nvidia-docker

Installation

Ubuntu distributions

Install the repository for your distribution by following the instructions here.
Install the nvidia-docker2 package and restart the Docker daemon:

sudo apt-get install nvidia-docker2
sudo pkill -SIGHUP dockerd

CentOS distributions

Install the repository for your distribution by following the instructions here.
Install the nvidia-docker2 package and restart the Docker daemon:

sudo yum install nvidia-docker2
sudo pkill -SIGHUP dockerd

Usage

NVIDIA runtime

nvidia-docker 2.0 registers a new container runtime to the Docker daemon.
You must select the nvidia runtime when using docker run:

docker run --runtime=nvidia --rm nvidia/cuda nvidia-smi

GPU isolation

Set the environment variable NVIDIA_VISIBLE_DEVICES in the container:

docker run --runtime=nvidia -e NVIDIA_VISIBLE_DEVICES=0 --rm nvidia/cuda nvidia-smi

Non-CUDA image:

Setting NVIDIA_VISIBLE_DEVICES will enable GPU support for any container image:

docker run --runtime=nvidia -e NVIDIA_VISIBLE_DEVICES=all --rm debian:stretch nvidia-smi

Advanced

Backward compatibility

To help transitioning code from 1.0 to 2.0, a bash script is provided in /usr/bin/nvidia-docker for backward compatibility.
It will automatically inject the --runtime=nvidia argument and convert NV_GPU to NVIDIA_VISIBLE_DEVICES.

Environment variables

The behavior of the runtime can be modified through environment variables (such as NVIDIA_VISIBLE_DEVICES).
Those environment variables are consumed by nvidia-container-runtime and are documented here.
Our official CUDA images use default values for these variables.

Default runtime

The default runtime used by the Docker® Engine is runc, our runtime can become the default one by configuring the docker daemon with --default-runtime=nvidia. Doing so will remove the need to add the --runtime=nvidia argument to docker run. It is also the only way to have GPU access during docker build.

Issues and Contributing

A signed copy of the Contributor License Agreement needs to be provided to digits@nvidia.com before any change can be accepted.

Please let us know by filing a new issue
You can contribute by opening a pull request

4.4 KiB Raw Blame History

Docker Engine Utility for NVIDIA GPUs

Differences with 1.0

Removing nvidia-docker 1.0

Ubuntu distributions

CentOS distributions

Installation

Ubuntu distributions

CentOS distributions

Usage

NVIDIA runtime

GPU isolation

Non-CUDA image:

Advanced

Backward compatibility

Environment variables

Default runtime

Issues and Contributing

4.4 KiB

Raw Blame History