Skip to content

fix(dflash): auto-detect CUDA architecture for Blackwell#98

Closed
easel wants to merge 1 commit intoLuce-Org:mainfrom
easel:feat/consumer-blackwell-auto-detect
Closed

fix(dflash): auto-detect CUDA architecture for Blackwell#98
easel wants to merge 1 commit intoLuce-Org:mainfrom
easel:feat/consumer-blackwell-auto-detect

Conversation

@easel
Copy link
Copy Markdown
Contributor

@easel easel commented May 4, 2026

Split out Blackwell auto-detection from workaround/consumer-specific logic so the workaround can be reviewed and merged independently.

What this changes

  • Query nvidia-smi at configure time when CMAKE_CUDA_ARCHITECTURES is unset or native.
  • Convert the reported compute capability (for example, 12.0) into CMAKE_CUDA_ARCHITECTURES.
  • Keep the change isolated to dflash/CMakeLists.txt.

@easel easel force-pushed the feat/consumer-blackwell-auto-detect branch from 27f128e to d7d150c Compare May 4, 2026 20:22
@easel easel force-pushed the feat/consumer-blackwell-auto-detect branch from d7d150c to 73091fb Compare May 4, 2026 20:24
@easel
Copy link
Copy Markdown
Contributor Author

easel commented May 4, 2026

Superseded by PR #48 (auto-detect) and PR #99 (workaround dependency).

@easel easel closed this May 4, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant