Add periodic cleanup for orphaned Celery pidbox queues by majamassarini · Pull Request #3085 · packit/packit-service

majamassarini · 2026-04-01T08:09:13Z

Problem: Celery workers create pidbox (control) reply queues for worker management commands (inspect, ping, stats, etc.). These queues accumulate when workers crash or restart improperly, leading to:

1,693+ orphaned *.reply.celery.pidbox keys in production
Keys with no TTL (TTL = -1) that persist indefinitely

Root cause: Celery's Redis transport does not provide a native way to set TTL on pidbox reply queues when they're created. These are internal implementation details of Celery's broadcast/control mechanism, and there's no configuration option to automatically expire them.

Solution: Heartbeat cleanup task Since we cannot tell Celery to natively set TTL on pidbox messages, we implement a periodic heartbeat task that:

Runs nightly at 12:30 AM via Celery beat
Scans for *.reply.celery.pidbox keys without TTL
Sets 1-hour expiration on orphaned queues
Tracks total Redis keys via Prometheus for monitoring

Related to: packit/deployment#701
Should fix: #2983

gemini-code-assist

Code Review

This pull request implements a maintenance task to clean up orphaned Celery pidbox reply queues in Redis by assigning a TTL to keys without one. The changes include a centralized Redis configuration utility, a new Prometheus metric for monitoring total Redis keys, and corresponding unit tests. Review feedback identifies a typo in a Redis environment variable and suggests using Redis pipelines to optimize the cleanup process by batching network operations.

packit_service/celerizer.py

packit_service/worker/tasks.py

centosinfra-prod-github-app · 2026-04-01T08:25:05Z

Build succeeded.
https://gateway-cloud-softwarefactory.apps.ocp.cloud.ci.centos.org/zuul/t/packit-service/buildset/4eb5d5d56f0d45b3a75d7ab8183f0bf2

✔️ pre-commit SUCCESS in 1m 48s
✔️ packit-service-tests SUCCESS in 4m 03s

Problem: Celery workers create pidbox (control) reply queues for worker management commands (inspect, ping, stats, etc.). These queues accumulate when workers crash or restart improperly, leading to: - 1,693+ orphaned *.reply.celery.pidbox keys in production - Keys with no TTL (TTL = -1) that persist indefinitely Root cause: Celery's Redis transport does not provide a native way to set TTL on pidbox reply queues when they're created. These are internal implementation details of Celery's broadcast/control mechanism, and there's no configuration option to automatically expire them. Solution: Heartbeat cleanup task Since we cannot tell Celery to natively set TTL on pidbox messages, we implement a periodic heartbeat task that: - Runs nightly at 12:30 AM via Celery beat - Scans for *.reply.celery.pidbox keys without TTL - Sets 1-hour expiration on orphaned queues - Tracks total Redis keys via Prometheus for monitoring Related to: packit/deployment#701 Should fix: packit#2983 Assisted-By: Claude Sonnet 4.5 <noreply@anthropic.com> Assisted-By: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

centosinfra-prod-github-app · 2026-04-07T13:10:05Z

Build succeeded.
https://gateway-cloud-softwarefactory.apps.ocp.cloud.ci.centos.org/zuul/t/packit-service/buildset/fe7f7a8c96cc4087b04bfa71c4dad458

✔️ pre-commit SUCCESS in 1m 50s
✔️ packit-service-tests SUCCESS in 5m 10s

majamassarini requested review from a team and mfocko as code owners April 1, 2026 08:09

usercont-release-bot added this to Packit pull requests Apr 1, 2026

github-project-automation bot moved this to New in Packit pull requests Apr 1, 2026

majamassarini force-pushed the prevent-valkey-filling-up branch from 98dfdbf to 69a55e2 Compare April 1, 2026 08:12

gemini-code-assist bot reviewed Apr 1, 2026

View reviewed changes

packit_service/celerizer.py Outdated Show resolved Hide resolved

packit_service/worker/tasks.py Show resolved Hide resolved

majamassarini force-pushed the prevent-valkey-filling-up branch from dc8a923 to 14d3b83 Compare April 1, 2026 08:20

majamassarini force-pushed the prevent-valkey-filling-up branch from 14d3b83 to d63d7dc Compare April 7, 2026 13:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add periodic cleanup for orphaned Celery pidbox queues#3085

Add periodic cleanup for orphaned Celery pidbox queues#3085
majamassarini wants to merge 1 commit intopackit:mainfrom
majamassarini:prevent-valkey-filling-up

majamassarini commented Apr 1, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

centosinfra-prod-github-app bot commented Apr 1, 2026

Uh oh!

centosinfra-prod-github-app bot commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

majamassarini commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

centosinfra-prod-github-app bot commented Apr 1, 2026

Uh oh!

centosinfra-prod-github-app bot commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

majamassarini commented Apr 1, 2026 •

edited

Loading