TAO Toolkit - Deploy Backend

Overview
Getting Started
Building a release container
Running TAO Deploy on Jetson devices
Contribution Guidelines
License

Overview

TAO Toolkit is a Python package hosted on the NVIDIA Python Package Index. It interacts with lower-level TAO dockers available from the NVIDIA GPU Accelerated Container Registry (NGC). The TAO containers come pre-installed with all dependencies required for training. The output of the TAO workflow is a trained model that can be deployed for inference on NVIDIA devices using DeepStream, TensorRT and Triton.

This repository contains the required implementation for all the deep learning components and networks using the TensorRT backend. These routines are packaged as part of the TAO Toolkit TensorRT container in the Toolkit package. The source code here is compatible with TensorRT version <= 8.5.3

Getting Started

As soon as the repository is cloned, run the envsetup.sh file to check if the build environment has the necessary dependencies, and the required environment variables are set.

source scripts/envsetup.sh

We recommend adding this command to your local ~/.bashrc file, so that every new terminal instance receives this.

Requirements

Hardware Requirements

Minimum system configuration

8 GB system RAM
4 GB of GPU RAM
8 core CPU
1 NVIDIA GPU
100 GB of SSD space

Recommended system configuration

32 GB system RAM
32 GB of GPU RAM
8 core CPU
1 NVIDIA GPU
100 GB of SSD space

Software Requirements

Software	Version
Ubuntu LTS	>=18.04
python	>=3.8.x
docker-ce	>19.03.5
docker-API	1.40
`nvidia-container-toolkit`	>1.3.0-1
nvidia-container-runtime	3.4.0-1
nvidia-docker2	2.5.0-1
nvidia-driver	>525.85
python-pip	>21.06

Instantiating the development container

In order to maintain a uniform development environment across all users, TAO Toolkit provides a base environment docker that has been built and uploaded to NGC for the developers. For instantiating the docker, simply run the tao_deploy CLI. The usage for the command line launcher is mentioned below.

usage: tao_deploy [-h] [--gpus GPUS] [--volume VOLUME] [--env ENV] [--mounts_file MOUNTS_FILE] [--shm_size SHM_SIZE] [--run_as_user] [--tag TAG] [--ulimit ULIMIT] [--port PORT]

Tool to run the TAO Toolkit Deploy container.

optional arguments:
  -h, --help            show this help message and exit
  --gpus GPUS           Comma separated GPU indices to be exposed to the docker.
  --volume VOLUME       Volumes to bind.
  --env ENV             Environment variables to bind.
  --mounts_file MOUNTS_FILE
                        Path to the mounts file.
  --shm_size SHM_SIZE   Shared memory size for docker
  --run_as_user         Flag to run as user
  --tag TAG             The tag value for the local dev docker.
  --ulimit ULIMIT       Docker ulimits for the host machine.
  --port PORT           Port mapping (e.g. 8889:8889).

A sample command to instantiate an interactive session in the base development docker is mentioned below.

tao_deploy --gpus all --volume /path/to/data/on/host:/path/to/data/on/container --volume /path/to/results/on/host:/path/to/results/in/container

Updating the base docker

There will be situations where developers would be required to update the third party dependencies to newer versions, or upgrade CUDA, etc. In such a case, please follow the steps below:

Build base docker

The base dev docker is defined in $NV_TAO_DEPLOY_TOP/docker/Dockerfile. The python packages required for the TAO dev is defined in $NV_TAO_DEPLOY_TOP/docker/requirements-pip.txt and the third party apt packages are defined in $NV_TAO_DEPLOY_TOP/docker/requirements-apt.txt. Once you have made the required change, please update the base docker using the build script in the same directory.

The unified build script supports multiple platforms:

--x86: Build for x86_64/AMD64 (uses Dockerfile)
--arm: Build for ARM64 (uses Dockerfile)
--multiplatform: Build for both x86_64 and ARM64 simultaneously (requires --push)
--l4t: Build for Jetson/L4T devices (uses Dockerfile.l4t)

The x86, ARM, and multiplatform builds use the same Dockerfile, differentiated by the --platform flag. L4T uses a separate Dockerfile due to its different base image.

git submodule update --init --recursive
git submodule foreach git pull origin main
cd $NV_TAO_DEPLOY_TOP/docker

# Build for x86_64/AMD64 (default on x86 hosts)
./build.sh --build --x86

# Or build for ARM64
./build.sh --build --arm

# Or build for Jetson/L4T
./build.sh --build --l4t

Test the newly built base docker

The build script tags the newly built base docker with the username of the account in the user's local machine. Therefore, the developers may tests their new docker by using the tao_deploy command with the --tag option.

tao_deploy --tag $USER -- script args

Update the new docker

Once you are sufficiently confident about the newly built base docker, please do the following

Push the newly built base docker to the registry (specify the target platform)

# For x86_64/AMD64
bash $NV_TAO_DEPLOY_TOP/docker/build.sh --build --x86 --push

# For ARM64
bash $NV_TAO_DEPLOY_TOP/docker/build.sh --build --arm --push

# For both x86_64 and ARM64 (multiplatform)
bash $NV_TAO_DEPLOY_TOP/docker/build.sh --build --multiplatform --push

# For Jetson/L4T
bash $NV_TAO_DEPLOY_TOP/docker/build.sh --build --l4t --push

The above step produces a digest associated with the docker. This is a unique identifier for the docker. So please note this, and update the platform-specific digest in the repository. The digests are stored in $NV_TAO_DEPLOY_TOP/docker/manifest.json with separate entries for x86 and arm platforms.

Push you final updated changes to the repository so that other developers can leverage and sync with the new dev environment.

Please note that if for some reason you would like to force build the docker without using a cache from the previous docker, you may do so by using the --force option.

bash $NV_TAO_DEPLOY_TOP/docker/build.sh --build --x86 --push --force

Building a release container

The TAO container is built on top of the TAO Deploy base dev container, by building a python wheel for the nvidia_tao_deploy module in this repository and installing the wheel in the Dockerfile defined in release/docker/Dockerfile. The whole build process is captured in a single shell script which may be run as follows:

git lfs install
git lfs pull
source scripts/envsetup.sh
cd $NV_TAO_DEPLOY_TOP/release/docker
./deploy.sh --build --wheel

In order to build a new docker, please edit the deploy.sh file in $NV_TAO_DEPLOY_TOP/release/docker to update the patch version and re-run the steps above.

Running TAO Deploy on Jetson devices

The released TAO Deploy container is based on x86 platform so it will not work on aarch64 platforms. In order to run TAO Deploy on Jetson devices, please instantiate TensorRT-L4T docker hosted on NGC using below command. Note that this TensorRT version may not match with the version from the released TAO container.

sudo docker run -it --rm --net=host --runtime nvidia -e DISPLAY=$DISPLAY -v /tmp/.X11-unix/:/tmp/.X11-unix nvcr.io/nvidia/l4t-tensorrt:xx

After the container is instantiated, run below steps to install the TAO Deploy wheel and corresponding dependencies on your system.

apt install libopenmpi-dev
pip install nvidia_tao_deploy==5.0.0.423.dev0
pip install https://files.pythonhosted.org/packages/f7/7a/ac2e37588fe552b49d8807215b7de224eef60a495391fdacc5fa13732d11/nvidia_eff_tao_encryption-0.1.7-cp38-cp38-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
pip install https://files.pythonhosted.org/packages/0d/05/6caf40aefc7ac44708b2dcd5403870181acc1ecdd93fa822370d10cc49f3/nvidia_eff-0.6.2-py38-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl

Contribution Guidelines

TAO Toolkit Deploy backend is not accepting contributions as part of the TAO 5.0 release, but will be open in the future.

License

This project is licensed under the Apache-2.0 License.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.github		.github
docker		docker
internal		internal
nvidia_tao_deploy		nvidia_tao_deploy
release		release
runner		runner
scripts		scripts
specs_output		specs_output
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
setup.py		setup.py
setup_l4t.py		setup_l4t.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TAO Toolkit - Deploy Backend

Overview

Getting Started

Requirements

Hardware Requirements

Minimum system configuration

Recommended system configuration

Software Requirements

Instantiating the development container

Updating the base docker

Build base docker

Test the newly built base docker

Update the new docker

Building a release container

Running TAO Deploy on Jetson devices

Contribution Guidelines

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors 5

Uh oh!

Languages

License

NVIDIA/tao_deploy

Folders and files

Latest commit

History

Repository files navigation

TAO Toolkit - Deploy Backend

Overview

Getting Started

Requirements

Hardware Requirements

Minimum system configuration

Recommended system configuration

Software Requirements

Instantiating the development container

Updating the base docker

Build base docker

Test the newly built base docker

Update the new docker

Building a release container

Running TAO Deploy on Jetson devices

Contribution Guidelines

License

About

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors 5

Uh oh!

Languages

Packages