Skip to content

Conversation

@kdamaszk
Copy link
Contributor

@kdamaszk kdamaszk commented Jul 8, 2025

Cherry-pick from main #163

* Add option to call block_softmax_adjustment op

* Enable block_softmax_adjustment by default for testing

* Add additional type conversion and checks for fp32_softmax

* Change default for VLLM_FUSED_BLOCK_SOFTMAX_ADJUSTMENT

* Reorder version checks and reorganize kernel specification

---------

Co-authored-by: Michal Adamczyk <[email protected]>
czhu15 added a commit that referenced this pull request Jul 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants