[NVCC] Modify template for FP8 type casting to be compilable in C++20 #1771

jushg · 2025-07-08T10:19:24Z

Context

As mentioned by #1770 , we are having some issue with compiling NCCL v2.27.5 using C++20 (it's compilable on C++11 --> 17). We figured out the root-cause is because of this section of device code in reduce_kernel.h, which does not work in C++20. My understanding is that in C++20, the rules for aggregate initialization and implicit conversions have become stricter.

In other words, this line return toPack(VecB(fromPack<VecA>(a))); relies on the ability to construct VecB from a VecA (or vice versa) via an implicit or aggregate conversion. In C++20, this is not allowed unless there is an explicit constructor or conversion operator.

Fix Detail

I attempted to create a localized fix for this particular reduce_kernel.h code snippet here by doing explicit and manual element-wise conversion between the types.

Update:

Seems like I was being a bit too manual, just need some pushing for compiler to use the correct operator, alternatively can just do return toPack((VecB)(fromPack<VecA>(a))); \ as well --> would similarly invoke the explicit operator https://docs.nvidia.com/cuda/cuda-math-api/cuda_math_api/struct____nv__fp8x2__e5m2.html#_CPPv4NK15__nv_fp8x2_e5m2cv6float2Ev

Have compiled successfully on C++14, C++17 and C++20.

Would appreciate some help with verifying whether the fix make sense for this particular case, or if not, what would be the correct fix in order to compile with C++20.

Change the casting template for FP8 types to be compatible with c++20

Turn out need much less change

jushg added 2 commits July 8, 2025 11:05

Update reduce_kernel.h

6bfa483

Change the casting template for FP8 types to be compatible with c++20

Update reduce_kernel.h (simpler)

51f5174

Turn out need much less change

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[NVCC] Modify template for FP8 type casting to be compilable in C++20 #1771

[NVCC] Modify template for FP8 type casting to be compilable in C++20 #1771

Uh oh!

jushg commented Jul 8, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[NVCC] Modify template for FP8 type casting to be compilable in C++20 #1771

Are you sure you want to change the base?

[NVCC] Modify template for FP8 type casting to be compilable in C++20 #1771

Uh oh!

Conversation

jushg commented Jul 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Fix Detail

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jushg commented Jul 8, 2025 •

edited

Loading