Fix propagation of PyTorch CUDA flags for multiple archs #59

stotko · 2025-06-07T14:23:22Z

In #57, the extra CUDA flags generated from PyTorch were transferred from the CUDA_NVCC_FLAGS variable to the proper target. However, the same modification for the CMAKE_CUDA_FLAGS variable was overlooked, resulting in duplicated flags. Furthermore, when multiple CUDA architectures are present, the flags were not correctly passed to the compiler. Make the extraction logic more robust and add a respective test case to cover this case as well.

stotko added the fixed label Jun 7, 2025

Fix propagation of PyTorch CUDA flags for multiple archs

f04a474

stotko force-pushed the cuda_flags_fix branch from 2bbeae5 to f04a474 Compare June 7, 2025 16:01

stotko merged commit d84ed96 into main Jun 7, 2025
24 checks passed

stotko deleted the cuda_flags_fix branch June 7, 2025 16:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix propagation of PyTorch CUDA flags for multiple archs #59

Fix propagation of PyTorch CUDA flags for multiple archs #59

Uh oh!

stotko commented Jun 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix propagation of PyTorch CUDA flags for multiple archs #59

Fix propagation of PyTorch CUDA flags for multiple archs #59

Uh oh!

Conversation

stotko commented Jun 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants