-
Notifications
You must be signed in to change notification settings - Fork 19
Open
Labels
bugSomething isn't workingSomething isn't working
Description
Prerequisites
- I searched existing issues
- I can reproduce this issue
Bug Description
Platform connector panics on shutdown
Component
Health Monitor
Steps to Reproduce
- Start platform connector
- Tail logs
- Kill pod and notice the error
Environment
- NVSentinel version: 0.3.0
- Kubernetes version: 1.33
- Deployment method: helm
Logs/Output
panic: runtime error: invalid memory address or nil pointer dereference
[signal SIGSEGV: segmentation violation code=0x1 addr=0x10 pc=0x199d89d]
goroutine 30 [running]:
github.com/nvidia/nvsentinel/platform-connectors/pkg/connectors/kubernetes.(*K8sConnector).processHealthEvents(0xc000245bc0, {0x24b5a60, 0xc0002520f0}, 0x0)
github.com/nvidia/nvsentinel/platform-connectors/pkg/connectors/kubernetes/process_node_events.go:327 +0x3d
github.com/nvidia/nvsentinel/platform-connectors/pkg/connectors/kubernetes.(*K8sConnector).FetchAndProcessHealthMetric(0xc000245bc0, {0x24b5a60, 0xc0002520f0})
github.com/nvidia/nvsentinel/platform-connectors/pkg/connectors/kubernetes/k8s_connector.go:86 +0x7e
created by main.initializeK8sConnector in goroutine 1
github.com/nvidia/nvsentinel/platform-connectors/main.go:109 +0x269
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working