Skip to content

fix: support SAC multi-GPU replay IPC#628

Merged
TATP-233 merged 1 commit into
mainfrom
fix/sac-multigpu-replay-ipc
Jun 21, 2026
Merged

fix: support SAC multi-GPU replay IPC#628
TATP-233 merged 1 commit into
mainfrom
fix/sac-multigpu-replay-ipc

Conversation

@TATP-233

Copy link
Copy Markdown
Collaborator

Summary

  • add a rank-local multi-GPU CPU-pinned replay pipeline for off-policy replay batches
  • route per-rank replay pack request/ready queues and shared host slots through the collector pack service
  • make the multi-GPU off-policy runner learner-class driven and use learner-provided initial parameter sync
  • gate this validation round to SAC and add regression coverage for ranked replay IPC

Validation

  • make test-all
  • Remote 2-GPU smoke on A800 cards 0 and 7: CUDA_VISIBLE_DEVICES=0,7 uv run train --algo sac --task g1_walk_flat --sim mujoco training.num_gpus=2 algo.obs_normalization=false algo.use_symmetry=false completed successfully (FastSAC_x2GPU, 5000 iterations)

@TATP-233 TATP-233 requested a review from caozx1110 as a code owner June 21, 2026 14:58
@TATP-233 TATP-233 force-pushed the fix/sac-multigpu-replay-ipc branch from abbe1f8 to 75ccb93 Compare June 21, 2026 15:12
@TATP-233 TATP-233 merged commit 54e8469 into main Jun 21, 2026
6 checks passed
@TATP-233 TATP-233 deleted the fix/sac-multigpu-replay-ipc branch June 21, 2026 15:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant