Skip to content

Conversation

@stotko
Copy link
Member

@stotko stotko commented Jun 7, 2025

In #57, the extra CUDA flags generated from PyTorch were transferred from the CUDA_NVCC_FLAGS variable to the proper target. However, the same modification for the CMAKE_CUDA_FLAGS variable was overlooked, resulting in duplicated flags. Furthermore, when multiple CUDA architectures are present, the flags were not correctly passed to the compiler. Make the extraction logic more robust and add a respective test case to cover this case as well.

@stotko stotko added the fixed label Jun 7, 2025
@stotko stotko merged commit d84ed96 into main Jun 7, 2025
24 checks passed
@stotko stotko deleted the cuda_flags_fix branch June 7, 2025 16:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants