Skip to content

[Bug]: Platform connector panics on shutdown #366

@lalitadithya

Description

@lalitadithya

Prerequisites

  • I searched existing issues
  • I can reproduce this issue

Bug Description

Platform connector panics on shutdown

Component

Health Monitor

Steps to Reproduce

  1. Start platform connector
  2. Tail logs
  3. Kill pod and notice the error

Environment

  • NVSentinel version: 0.3.0
  • Kubernetes version: 1.33
  • Deployment method: helm

Logs/Output

panic: runtime error: invalid memory address or nil pointer dereference
[signal SIGSEGV: segmentation violation code=0x1 addr=0x10 pc=0x199d89d]

goroutine 30 [running]:
github.com/nvidia/nvsentinel/platform-connectors/pkg/connectors/kubernetes.(*K8sConnector).processHealthEvents(0xc000245bc0, {0x24b5a60, 0xc0002520f0}, 0x0)
	github.com/nvidia/nvsentinel/platform-connectors/pkg/connectors/kubernetes/process_node_events.go:327 +0x3d
github.com/nvidia/nvsentinel/platform-connectors/pkg/connectors/kubernetes.(*K8sConnector).FetchAndProcessHealthMetric(0xc000245bc0, {0x24b5a60, 0xc0002520f0})
	github.com/nvidia/nvsentinel/platform-connectors/pkg/connectors/kubernetes/k8s_connector.go:86 +0x7e
created by main.initializeK8sConnector in goroutine 1
	github.com/nvidia/nvsentinel/platform-connectors/main.go:109 +0x269

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions