Skip to content

Manage lifecycle of compute-domain-daemon's service account outside of helm #586

@klueska

Description

@klueska

At present, if a DRA driver gets helm uninstalled, the service account created for any running daemonsets spawned by the compute domain controller gets removed. This means that the daemonset will fail to access any API service objects during the time that the DRA driver is uninstalled. In general, this only really affect upgrades / downgrades of the DRA driver, but makes things especially tricky when switching namespaces across an upgrade in the context of the GPU Operator. To fix this, we need to move the lifecylce management of this service account into the controller so that it gets created / deleted alongside the daemonset itself.

Metadata

Metadata

Assignees

Labels

robustnessissue/pr: edge cases & fault tolerancesecurity

Type

No type

Projects

Status

Backlog

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions