Skip to content

[Bug]: Health monitor doesn't exit if it can't publish messages and this sometimes leads to dropped messages #196

@lalitadithya

Description

@lalitadithya

Prerequisites

  • I searched existing issues
  • I can reproduce this issue

Bug Description

When the health monitors cannot publish to the UDS socket after all the retries have been exhausted, it doesn't exit with an error and in some cases, this causes drops in health messages.

The health monitor should exit with status code 1 if all retires for the publish have failed.

Component

Health Monitor

Steps to Reproduce

  1. Start the health monitor
  2. Disable the platform connector on that host
  3. Trigger a health event

Environment

  • NVSentinel version: all
  • Kubernetes version: all
  • Deployment method: helm

Logs/Output

No response

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions