-
Notifications
You must be signed in to change notification settings - Fork 139
Pull requests: SemiAnalysisAI/InferenceX
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: improve benchmark serving to work better under high concurrency
#1102
opened Apr 20, 2026 by
cquil11
Collaborator
Loading…
Add B300 config: kimi-k2.5-fp4-vllm
sweep-enabled
#1100
opened Apr 20, 2026 by
cquil11
Collaborator
Loading…
2 tasks
[AMD/ROCM] GLM5.1 FP4 (MXFP4) MI355X Support
AMD
#1098
opened Apr 20, 2026 by
ajith-sirra-amd
Contributor
Loading…
Trigger H200 multinode evals & revert MI355X image to mori-0227-3
sweep-enabled
#1094
opened Apr 19, 2026 by
Oseltamivir
Collaborator
Loading…
[AMD][MI300X] Extend GPT-OSS FP4 TP=8 search to conc=1 (extends interactivity frontier to ~249 tps/user)
AMD
#1092
opened Apr 19, 2026 by
ramineroane
Collaborator
Loading…
[SGLang broken] Add MI355X config: glm5-fp4-sglang-mtp
vllm/sglang release broken -need to wait
#1091
opened Apr 18, 2026 by
functionstackx
Contributor
•
Draft
4 of 5 tasks
[sglang broken] Add MI355X config: qwen3.5-fp4-sglang-mtp
vllm/sglang release broken -need to wait
#1078
opened Apr 18, 2026 by
functionstackx
Contributor
Loading…
3 of 4 tasks
Add B300 config: kimi-k2.5-int4-vllm
#1071
opened Apr 17, 2026 by
cquil11
Collaborator
Loading…
2 tasks
[WIP][NV] update minimaxm2.5 fp4 b200 vllm flag
NVIDIA
sweep-enabled
#1069
opened Apr 17, 2026 by
hshrivastava-droid
Collaborator
Loading…
[WIP][NV] update minimaxm2.5-fp8-b200-vllm
NVIDIA
sweep-enabled
#1068
opened Apr 17, 2026 by
hshrivastava-droid
Collaborator
Loading…
[Do Not Merge] Upgrade Kimi-K2.5-INT4-MI355X-vLLM image to upstream daily image bcc2306cefa4179c548d3e638e7a22a88d281733
sweep-enabled
#1066
opened Apr 17, 2026 by
chunfangamd
Collaborator
Loading…
[WIP][NV] qwen35 b200 MTP update sglang config
NVIDIA
sweep-enabled
#1065
opened Apr 17, 2026 by
hshrivastava-droid
Collaborator
Loading…
Add options to override default extra_body and num_prompts when profiler is enabled
#1044
opened Apr 16, 2026 by
devalshahamd
Loading…
[WIP] [AMD/ROCM] atom glm5 fp4 on mi355x
AMD
#1043
opened Apr 16, 2026 by
seungrokj
Collaborator
Loading…
[WIP] [AMD/ROCM] atom minimaxm2.5 fp4 on mi355x
AMD
#1042
opened Apr 16, 2026 by
seungrokj
Collaborator
Loading…
[WIP] [AMD/ROCM] atom qwen fp8/bf16 on mi355x
AMD
#1040
opened Apr 16, 2026 by
seungrokj
Collaborator
Loading…
Add full-sweep-enabled label to split sweep tiers
#1039
opened Apr 16, 2026 by
n0madsky
Loading…
2 of 5 tasks
Locks python dependencies for CI into a requirements.txt file and cache the dependencies
#1037
opened Apr 16, 2026 by
n0madsky
Loading…
[Do Not Merge][NV] GLM5 fp8 update sglang container
NVIDIA
sweep-enabled
#1033
opened Apr 15, 2026 by
hshrivastava-droid
Collaborator
Loading…
[experimental] add multi-turn KV cache stress benchmark traces
#1032
opened Apr 15, 2026 by
OCWC22
Loading…
4 tasks
[Do Not Merge] update glm5 fp8 b200 sglang container
NVIDIA
sweep-enabled
#1030
opened Apr 14, 2026 by
hshrivastava-droid
Collaborator
Loading…
1 task
Add vLLM dynamic scheduler reconfigure for single-server sweeps
#1029
opened Apr 14, 2026 by
JordanNanos
Collaborator
Loading…
3 of 6 tasks
[WIP] Update Qwen3.5 FP8 B200 SGLang
sweep-enabled
#1027
opened Apr 13, 2026 by
Ankur-singh
Collaborator
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.