Skip to content

Conversation

@hlin99
Copy link

@hlin99 hlin99 commented Nov 21, 2025

The change is to resolve HPU(Gaudi) specific requirement that padding in chunked prefill mode can streamline the entire solution.

…ith padding-aware scheduling

The change is to resolve HPU(Gaudi) specific requirement that padding
in chunked prefill mode can streamline the entire solution.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant