Commit 6e5af1e
authored
Revert tracking of Work status for FlightRecorder in ProcessGroupXCCL (#2076)
The callback used to track the work status in ProcessGroupXCCL was
causing an unintended memory leak by maintaining the work objects and
therefor the stashed tensors. For now, I'm removing the callback and I
have added a unit test to ensure this memory leak isn't returning.
Fix #20841 parent bc52e63 commit 6e5af1e
File tree
3 files changed
+36
-17
lines changed- src/xccl
- test/xpu/distributed
3 files changed
+36
-17
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
437 | 437 | | |
438 | 438 | | |
439 | 439 | | |
440 | | - | |
441 | | - | |
442 | | - | |
443 | | - | |
444 | | - | |
445 | | - | |
446 | | - | |
447 | | - | |
448 | | - | |
449 | | - | |
450 | | - | |
451 | 440 | | |
452 | 441 | | |
453 | 442 | | |
| |||
777 | 766 | | |
778 | 767 | | |
779 | 768 | | |
| 769 | + | |
780 | 770 | | |
781 | | - | |
| 771 | + | |
| 772 | + | |
| 773 | + | |
| 774 | + | |
782 | 775 | | |
783 | 776 | | |
784 | 777 | | |
| |||
889 | 882 | | |
890 | 883 | | |
891 | 884 | | |
892 | | - | |
893 | | - | |
894 | | - | |
| 885 | + | |
| 886 | + | |
| 887 | + | |
| 888 | + | |
| 889 | + | |
| 890 | + | |
895 | 891 | | |
896 | 892 | | |
897 | 893 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
424 | 424 | | |
425 | 425 | | |
426 | 426 | | |
427 | | - | |
428 | | - | |
429 | 427 | | |
430 | 428 | | |
431 | 429 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
365 | 365 | | |
366 | 366 | | |
367 | 367 | | |
| 368 | + | |
| 369 | + | |
| 370 | + | |
| 371 | + | |
| 372 | + | |
| 373 | + | |
| 374 | + | |
| 375 | + | |
| 376 | + | |
| 377 | + | |
| 378 | + | |
| 379 | + | |
| 380 | + | |
| 381 | + | |
| 382 | + | |
| 383 | + | |
| 384 | + | |
| 385 | + | |
| 386 | + | |
| 387 | + | |
| 388 | + | |
| 389 | + | |
| 390 | + | |
| 391 | + | |
| 392 | + | |
368 | 393 | | |
369 | 394 | | |
370 | 395 | | |
| |||
0 commit comments