How is this issue impacting you?
Lower performance than expected
Share Your Debug Logs
No response
Steps to Reproduce the Issue
No response
NCCL Version
2.28.3+cu129
Your platform details
I find when enable NCCL_GRAPH_MIXING_SUPPORT, it will launch multiple stream in cuda graph:
when I turn off, it seems reasonable:
Error Message & Behavior
No response