Skip to content

Driver init fails in air-gapped clusters due to hard-coded mount of Red Hat subscription repo config #1501

@changhyuni

Description

@changhyuni

Summary

When deploying GPU Operator in an air-gapped (offline) cluster the nvidia-driver-daemonset init container fails to start.
Root cause: the driver image ships with a public YUM repo enabled by default, which triggers yum errors in offline environments.
Additionally, the pod spec tries to mount /etc/yum.repos.d/redhat.repo (HostPath, File) but the file is absent, so the kubelet rejects the volume with hostPath type check failed.

Node OS: RHEL 8.10

We rebuilt the driver image and **removed /etc/yum.repos.d/redhat.repo

volMountSubscriptionName := fmt.Sprintf("subscription-config-%d", num)

Related issue

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugIssue/PR to expose/discuss/fix a bug

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions