-
Notifications
You must be signed in to change notification settings - Fork 208
Pull requests: SemiAnalysisAI/InferenceX
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[codex] update MiniMax M3 FP8 MI355X vLLM image
full-sweep-fail-fast
#1942
opened Jun 26, 2026 by
functionstackx
Collaborator
Loading…
[AMD] Add MiniMax-M3-FP4 MI355X ATOMESH update 0623
AMD
evals-only
Suppress throughput and run only eval jobs; combine with all-evals to expand selection
full-sweep-enabled
#1940
opened Jun 26, 2026 by
seungrokj
Collaborator
Loading…
8 tasks
[NV] initial submission for minimax-m3 fp8 GB200 disagg receipes
full-sweep-enabled
#1938
opened Jun 25, 2026 by
richardhuo-nv
Collaborator
Loading…
Add MiniMax-M3 MXFP8 B300 1k/1k sweep and update image
#1937
opened Jun 25, 2026 by
RohitNagraj
Collaborator
Loading…
Add MiniMax-M3 NVFP4 B200 single-node vLLM benchmark (EAGLE3 spec decode)
full-sweep-enabled
#1933
opened Jun 25, 2026 by
Ankur-singh
Collaborator
Loading…
Add MiniMax-M3 NVFP4 B200 single-node aggregated vLLM benchmark
full-sweep-enabled
#1932
opened Jun 25, 2026 by
Ankur-singh
Collaborator
Loading…
Add MiniMax-M3 NVFP4 B300 Dynamo vLLM benchmarks
full-sweep-enabled
#1931
opened Jun 25, 2026 by
Oseltamivir
Collaborator
Loading…
[AMD] Add MiniMax-M3-FP8 MI355X ATOMESH update 0623
all-evals
Expand eval selection to every fixed-sequence config
AMD
evals-only
Suppress throughput and run only eval jobs; combine with all-evals to expand selection
#1930
opened Jun 25, 2026 by
seungrokj
Collaborator
Loading…
8 tasks
[codex] Add golden AL distributions
#1926
opened Jun 24, 2026 by
functionstackx
Collaborator
Loading…
[NV] Refresh Minimax M3 FP8 submission with new recipes for GB300
full-sweep-enabled
#1925
opened Jun 24, 2026 by
richardhuo-nv
Collaborator
Loading…
[WIP][NV] dsv4-fp4-b200-sglang image to SGLang nightly 20260624
full-sweep-enabled
#1923
opened Jun 24, 2026 by
hshrivastava-droid
Collaborator
Loading…
Update B300 FP4 SGLang (non-MTP) image to latest nightly
full-sweep-enabled
#1913
opened Jun 24, 2026 by
hshrivastava-droid
Collaborator
Loading…
Add GLM-5-FP8 GB300 multinode dynamo-sglang MTP benchmark
full-sweep-enabled
#1907
opened Jun 23, 2026 by
hshrivastava-droid
Collaborator
Loading…
glm5.1-fp4-mi355x-sglang: bump image to v0.5.13.post1-20260622 + enable aiter allreduce fusion
full-sweep-enabled
#1905
opened Jun 23, 2026 by
jiacao-amd
Collaborator
Loading…
CollectiveX: experimental cross-vendor collective/EP benchmark
#1896
opened Jun 23, 2026 by
Oseltamivir
Collaborator
Loading…
Add GLM-5-FP8 GB200 dynamo-sglang multinode benchmark
full-sweep-enabled
#1895
opened Jun 23, 2026 by
hshrivastava-droid
Collaborator
Loading…
[AMD] dsv4 atom-disagg eval sweep — validate reduced ATOM logging
all-evals
Expand eval selection to every fixed-sequence config
evals-only
Suppress throughput and run only eval jobs; combine with all-evals to expand selection
full-sweep-enabled
#1882
opened Jun 22, 2026 by
Oseltamivir
Collaborator
Loading…
[CI] Validate aggregate benchmark results before upload
#1881
opened Jun 21, 2026 by
edwingao28
Loading…
[codex] Enforce complete eval validation and quiet ATOM logs
#1878
opened Jun 21, 2026 by
Oseltamivir
Collaborator
•
Draft
[Klaud Cold] MI300X MiniMax-M3 nightly image and FP8 KV cache
full-sweep-fail-fast
#1858
opened Jun 19, 2026 by
cquil11
Collaborator
Loading…
[AMD] Add DSv4-FP4-MI355X ATOMMESH MTP
AMD
#1855
opened Jun 19, 2026 by
seungrokj
Collaborator
Loading…
2 tasks
[AMD] Optimize MiniMax M3 sparse index scoring on MI300X
sweep-enabled
#1840
opened Jun 18, 2026 by
Oseltamivir
Collaborator
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-06-22.