Skip to content

Pull requests: SemiAnalysisAI/InferenceX

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP] agentx integration
#1103 opened Apr 20, 2026 by cquil11 Collaborator Draft
feat: improve benchmark serving to work better under high concurrency
#1102 opened Apr 20, 2026 by cquil11 Collaborator Loading…
Add B300 config: kimi-k2.5-fp4-vllm sweep-enabled
#1100 opened Apr 20, 2026 by cquil11 Collaborator Loading…
2 tasks
[AMD/ROCM] GLM5.1 FP4 (MXFP4) MI355X Support AMD
#1098 opened Apr 20, 2026 by ajith-sirra-amd Contributor Loading…
Add B300 config: kimi-k2.5-int4-vllm
#1071 opened Apr 17, 2026 by cquil11 Collaborator Loading…
2 tasks
[WIP] [AMD/ROCM] atom glm5 fp4 on mi355x AMD
#1043 opened Apr 16, 2026 by seungrokj Collaborator Loading…
[WIP] [AMD/ROCM] atom minimaxm2.5 fp4 on mi355x AMD
#1042 opened Apr 16, 2026 by seungrokj Collaborator Loading…
[WIP] [AMD/ROCM] atom qwen fp8/bf16 on mi355x AMD
#1040 opened Apr 16, 2026 by seungrokj Collaborator Loading…
Add full-sweep-enabled label to split sweep tiers
#1039 opened Apr 16, 2026 by n0madsky Loading…
2 of 5 tasks
[experimental] add multi-turn KV cache stress benchmark traces
#1032 opened Apr 15, 2026 by OCWC22 Loading…
4 tasks
Add vLLM dynamic scheduler reconfigure for single-server sweeps
#1029 opened Apr 14, 2026 by JordanNanos Collaborator Loading…
3 of 6 tasks
[WIP] Update Qwen3.5 FP8 B200 SGLang sweep-enabled
#1027 opened Apr 13, 2026 by Ankur-singh Collaborator Loading…
ProTip! no:milestone will show everything without a milestone.