Skip to content

Commit 415eb95

Browse files
committed
feat: nvidia driver extension
This patch deprecates the NVIDIA toolkit extension and introduces a new nvidia-driver extension (in production/lts versions and open source/proprietary flavors). The NVIDIA container toolkit must be installed independently, via a future Talos extension, the NVIDIA GPU Operator, or by the cluster administator. The extension depends on the new glibc extension (#473) and participates in its filesystem subroot by installing all the NVIDIA components in it. Finally, the extension runs a service that will bind mount this glibc subroot at `/run/nvidia/driver` and run the `nvidia-persistenced` daemon. This careful setup allows the NVIDIA GPU Operator to utilize this extension as if it were a traditional NVIDIA driver container. Signed-off-by: Jean-Francois Roy <[email protected]>
1 parent 30eb717 commit 415eb95

File tree

69 files changed

+505
-1126
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

69 files changed

+505
-1126
lines changed

.kres.yaml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -34,12 +34,12 @@ spec:
3434
- nebula
3535
- nfsd
3636
- nut-client
37-
- nvidia-container-toolkit-lts
38-
- nvidia-container-toolkit-production
3937
- nvidia-fabricmanager-lts
4038
- nvidia-fabricmanager-production
41-
- nvidia-open-gpu-kernel-modules-lts
42-
- nvidia-open-gpu-kernel-modules-production
39+
- nvidia-driver-lts
40+
- nvidia-driver-production
41+
- nvidia-driver-proprietary-lts
42+
- nvidia-driver-proprietary-production
4343
- nvme-cli
4444
- panfrost
4545
- qemu-guest-agent

Makefile

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -89,12 +89,12 @@ TARGETS += metal-agent
8989
TARGETS += nebula
9090
TARGETS += nfsd
9191
TARGETS += nut-client
92-
TARGETS += nvidia-container-toolkit-lts
93-
TARGETS += nvidia-container-toolkit-production
9492
TARGETS += nvidia-fabricmanager-lts
9593
TARGETS += nvidia-fabricmanager-production
96-
TARGETS += nvidia-open-gpu-kernel-modules-lts
97-
TARGETS += nvidia-open-gpu-kernel-modules-production
94+
TARGETS += nvidia-driver-lts
95+
TARGETS += nvidia-driver-production
96+
TARGETS += nvidia-driver-proprietary-lts
97+
TARGETS += nvidia-driver-proprietary-production
9898
TARGETS += nvme-cli
9999
TARGETS += panfrost
100100
TARGETS += qemu-guest-agent

go.work

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,5 @@ go 1.23.0
22

33
use (
44
./examples/hello-world-service/src
5-
./nvidia-gpu/nvidia-container-toolkit/nvidia-container-runtime-wrapper
6-
./nvidia-gpu/nvidia-container-toolkit/nvidia-persistenced-wrapper
5+
./nvidia-gpu/nvidia-driver/service
76
)

nvidia-gpu/nonfree/kmod-nvidia/lts/manifest.yaml

Lines changed: 0 additions & 10 deletions
This file was deleted.

nvidia-gpu/nonfree/kmod-nvidia/production/files/nvidia.conf

Lines changed: 0 additions & 4 deletions
This file was deleted.

nvidia-gpu/nonfree/kmod-nvidia/production/manifest.yaml

Lines changed: 0 additions & 10 deletions
This file was deleted.

nvidia-gpu/nonfree/kmod-nvidia/production/pkg.yaml

Lines changed: 0 additions & 31 deletions
This file was deleted.

nvidia-gpu/nvidia-container-toolkit/DEVELOPMENT.md

Lines changed: 0 additions & 30 deletions
This file was deleted.

nvidia-gpu/nvidia-container-toolkit/README.md

Lines changed: 0 additions & 91 deletions
This file was deleted.

nvidia-gpu/nvidia-container-toolkit/lts/manifest.yaml

Lines changed: 0 additions & 10 deletions
This file was deleted.

0 commit comments

Comments
 (0)