Prerequisites
Feature Summary
Service to perform remediation of node-level issues (e.g. rebooting the node, restarting some process on the host, or addressing hardware failures that require the node to be taken out of the Kubernetes cluster and return to the managed service operator)
Problem/Use Case
Using the project description as a guiding principle, currently NVSentinel does provide this capability.
Proposed Solution
Implementation TBD
Component
New Component