-
Notifications
You must be signed in to change notification settings - Fork 412
Open
Labels
bugIssue/PR to expose/discuss/fix a bugIssue/PR to expose/discuss/fix a bugneeds-triageissue or PR has not been assigned a priority-px labelissue or PR has not been assigned a priority-px label
Description
Describe the bug
When upgrading from GPU Operator 25.3.4 to 25.10.0, having the GDS option enabled in both versions, the nvidia-fs-ctr container fails to build the module due to gcc-12 not being found. The binary is available in the container under /run/nvidia/driver/usr/bin but this isn't part of the PATH.
To Reproduce
Install GPU Operator 25.10.0 and enable GDS with gds.enabled: true. Notice the nvidia-fs-ctr container of the driver pod goes into crashloop.
Expected behavior
The gcc binaries should be available by including /run/nvidia/driver/usr/bin in the PATH.
If I manually edit the args parameter for this container in the daemonset to augment the PATH variable, it works:
args:
- until [ -d /run/nvidia/driver/usr/src ] && lsmod | grep nvidia; do echo Waiting for nvidia-driver to be installed...; sleep 10; done; PATH=/run/nvidia/driver/usr/bin:$PATH; exec nvidia-gds-driver install
Environment (please provide the following information):
- GPU Operator Version: v25.10.0
- OS: Ubuntu22.04
- Kernel Version: 6.5.0-generic
- Container Runtime Version: containerd 2.1.4
- Kubernetes Distro and Version: Kubeadm 1.33.4
Metadata
Metadata
Assignees
Labels
bugIssue/PR to expose/discuss/fix a bugIssue/PR to expose/discuss/fix a bugneeds-triageissue or PR has not been assigned a priority-px labelissue or PR has not been assigned a priority-px label