Skip to content

Pull requests: huggingface/text-generation-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: expose GPU energy consumption (mJ) in responses
#3315 opened Aug 28, 2025 by JulienDelavande Loading…
2 of 5 tasks
chore: prepare version 3.3.5
#3314 opened Aug 27, 2025 by tengomucho Loading…
Add missing backslash
#3311 opened Aug 12, 2025 by philsupertramp Loading…
2 of 5 tasks
support qwen3 on nvidia
#3302 opened Jul 23, 2025 by icyxp Loading…
Attempt to fix CI errors
#3292 opened Jul 8, 2025 by danieldk Loading…
5 tasks
fix: enable defs references in tool calls
#3291 opened Jul 7, 2025 by drbh Loading…
Update quantization kernels
#3288 opened Jul 7, 2025 by danieldk Draft
5 tasks
feat: allow json_schema in response format and add test
#3276 opened Jun 25, 2025 by drbh Loading…
Disable mamba in CPU platform
#3266 opened Jun 13, 2025 by casassg Loading…
3 of 5 tasks
feat: improve llava next pooling for granite vision
#3255 opened Jun 4, 2025 by drbh Loading…
Trtllm backend improvements
#3231 opened May 17, 2025 by leejuyuu Loading…
1 of 5 tasks
Fix typos
#3210 opened May 6, 2025 by omahs Loading…
1 of 5 tasks
feat: lock updated kernel versions
#3201 opened Apr 29, 2025 by drbh Loading…
Set uv UV_PYTHON_INSTALL_DIR explicitly
#3197 opened Apr 27, 2025 by sebastianliebscher Loading…
1 of 5 tasks
2
README: minimum Python version is 3.10
#3194 opened Apr 25, 2025 by Frenzie Loading…
1 of 5 tasks
feat: support logit bias in chat request
#3186 opened Apr 22, 2025 by drbh Loading…
Fix flashinfer plan call to use positional arguments for #3165
#3166 opened Apr 11, 2025 by ruckc Loading…
2 of 5 tasks
Update to flashinfer 0.2.5
#3164 opened Apr 11, 2025 by danieldk Draft
5 tasks
Add chunked attn for L4
#3162 opened Apr 10, 2025 by mht-sharma Draft
2 of 7 tasks
Update links Inferentia refer docs
#3154 opened Apr 9, 2025 by guspan-tanadi Loading…
1 of 5 tasks
feat: align function id with tool call response
#3111 opened Mar 13, 2025 by drbh Loading…
wip: comment out prepend full_text
#3079 opened Mar 7, 2025 by jrc2139 Draft
1 of 5 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.