[codex] fix orchestrator routed experts memory retention by samsja · Pull Request #2623 · PrimeIntellect-ai/prime-rl

samsja · 2026-05-25T02:11:28Z

Summary

Fixes bounded routed-experts memory retention in the orchestrator step loop.

The routed replay path copies each rollout's tokens["routed_experts"] payload into TrainingSample.routed_experts, but the original rollout sidecars stayed attached to train_rollouts until end-of-step cleanup. The results list from interleave_rollout(...) also kept packed samples alive after the orchestrator had already extracted train_examples.

This change:

clears routed-expert sidecars from rollout trajectories after conversion into training samples
deletes the intermediate results list once samples have been extracted
includes filter_df and timing_df in the explicit per-step cleanup before malloc_trim(0)

This reduces per-step RSS retention and peak memory in router replay runs. It does not claim to fix every possible monotonic production leak; ZMQ backpressure and monitor futures remain separate things to inspect if RSS still slopes upward.

Validation

uv run pytest tests/unit/orchestrator/test_trajectories.py tests/unit/orchestrator/test_batch.py -q
uv run ruff check src/prime_rl/orchestrator/orchestrator.py src/prime_rl/orchestrator/trajectories.py
uv run ruff format --check src/prime_rl/orchestrator/orchestrator.py src/prime_rl/orchestrator/trajectories.py
synthetic RSS probe comparing pre-patch-like retention vs patched cleanup:
- pre-patch-like retained about +53.3 MB after cleanup for the probe payload
- patched cleanup returned to baseline (+0.0 MB mean over baseline)

fix orchestrator routed experts memory retention

fe7a209

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[codex] fix orchestrator routed experts memory retention#2623

[codex] fix orchestrator routed experts memory retention#2623
samsja wants to merge 1 commit into
mainfrom
fix/orchestrator-routed-experts-memory

samsja commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

samsja commented May 25, 2026

Summary

Validation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant