-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Open
Labels
Description
Question
when NIC bandwidth is low, the channel search algorithm will end with PXN channels. so ring alteration wron't execute.
I prefer ring alteration for rail optimization. Also compared with PXN, crossNic channels has less hops.
after some investigation, I found ncclTopoCompareGraphs() doesn't check hop count when graphs have different crossNic value:
Line 436 in b17addf
| if (graph->pattern == refGraph->pattern && graph->crossNic == refGraph->crossNic && graph->nHops < refGraph->nHops) *copy = 1; |
as a result, it prefers PXN graph than crossNic graph here.
so the questions are:
- why PXN is preferred than crossNic ?
- should we prefer less hops here, especially for ring pattern ?