-
Notifications
You must be signed in to change notification settings - Fork 165
Pull requests: Luce-Org/lucebox-hub
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Gemma4 support: pFlash + DFlash + chunked prefill, daemon mode, server routing
#131
opened May 8, 2026 by
dusterbloom
Contributor
Loading…
4 of 5 tasks
draft: sliding window attention for Qwen3.6 draft model
#129
opened May 8, 2026 by
howard0su
Contributor
Loading…
fix(dflash): expose qwen reasoning in streaming and non-streaming response
#124
opened May 8, 2026 by
jkyamog
Loading…
bench(dflash,pflash): add CUDA/HIP mixed backend placement
#122
opened May 7, 2026 by
weicj
Contributor
Loading…
Add HIP/ROCm support for Strix Halo (gfx1151)
#119
opened May 7, 2026 by
smpurkis
Loading…
3 of 4 tasks
feat(dflash): support Qwen3.6-27B-DFlash draft (SWA layers) — 106 t/s on RTX 4090
#94
opened May 4, 2026 by
Quitetall
Contributor
Loading…
perf(pflash): add SM75 target-resident TTFT path
#72
opened May 1, 2026 by
weicj
Contributor
Loading…
dflash: split target/draft StepGraphs to fix ggml_gallocr realloc per spec-decode step (issue #55)
#62
opened Apr 29, 2026 by
dusterbloom
Contributor
Loading…
4 of 5 tasks
fix(dflash): auto-detect GPU arch to prevent sm_120a on consumer Blackwell
#48
opened Apr 27, 2026 by
easel
Contributor
Loading…
2 tasks
feat(dflash): MoE 35B-A3B support + DDTree CUDA graph reuse
#39
opened Apr 27, 2026 by
dusterbloom
Contributor
Loading…
4 of 5 tasks
ProTip!
Exclude everything labeled
bug with -label:bug.