Description
According to the GPU Operator with KubeVirt documentation, we can assign labels like container, vm-passthrough, or vm-vgpu to nodes, and the GPU Operator deploys specific operands based on these labels.
Question:
If a node is initially labeled with container and the GPU Operator deploys corresponding components, will it automatically clean up the container-related components and deploy vm-passthrough-specific operands when an SRE modifies the node’s label to vm-passthrough? Conversely, if a node’s label is changed from vm-passthrough back to container, will the Operator also handle this transition automatically?
Context:
We need to confirm whether the GPU Operator’s controller actively reconciles node label changes to ensure only the correct operands remain on nodes, or if manual intervention (e.g., deleting old DaemonSets/Pods) is required after label updates.
Why This Matters
Understanding this behavior is critical for:
- Ensuring clean transitions between GPU workload modes (e.g., containerized vs. VM-passthrough).
- Avoiding resource conflicts or performance degradation due to orphaned components.