Skip to content

[CUDA] PagedAttention: early-return on empty query input (token_count…

7375578
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Open

[CUDA] PagedAttention: add SM<80 fp16 fallback via memory-efficient attention #28200

[CUDA] PagedAttention: early-return on empty query input (token_count…
7375578
Select commit
Loading
Failed to load commit list.

Annotations

4 warnings
5. Build Extended Minimal
succeeded Apr 25, 2026 in 7m 30s