Minimax M2.5 MXFP4 benchmark for MI355x vLLM v0.19.1 (TP=1,2,4)#827
Minimax M2.5 MXFP4 benchmark for MI355x vLLM v0.19.1 (TP=1,2,4)#827functionstackx wants to merge 17 commits intomainfrom
Conversation
|
@claude can u comment out TP=4 and only do TP=2 for now since TP=4 is bugged vllm-project/vllm#35637 |
|
Claude finished @functionstackx's task in 53s —— View job Commented out TP=4 for MiniMax M2.1 MXFP4Due to vllm-project/vllm#35637, TP=4 has been disabled. Only TP=2 will run.
Changes made:
|
|
@chunfangamd i am seeing good mxfp4 performance when comparing to FP8. just waiting for AMD to release mxfp4 version of MiniMax M2.5 now. currently AMD only have mxfp4 version of MiniMax M2.1.
|
bd10495 to
e849d65
Compare
86cc700 to
b82116b
Compare
b82116b to
7dd6063
Compare
Add error handling for model download failure
Use sudo for permission changes and directory removal.
|
@benenzhu vllm/vllm-openai-rocm:v0.19.1 is ready, can you submit a PR ? cc. @ajith-sirra-amd @functionstackx |
|
Feel free to ping me for quick review when u have a PR for minimax mxfp4 that is working and has passed pr validation and there is an vllm-projects/recipes update (if needed) |
|
https://github.com/SemiAnalysisAI/InferenceX/actions/runs/24647578553/job/72063351980?pr=827 @seungrokj Hi, I retriggerd the CI, seems the |
|
@chunfangamd can u take a look at debugging this? Probably previous runs have corrupted the huggingface model cache from @benenzhu testing. The fix is just rm this minimax ckpt folder on the cluster |
@functionstackx Hi I think that's some gh runner problem in this machine, the docker command also fails. |
Both @chunfangamd and/or @cquil11 have cluster access to fix it. @chunfangamd can u help? |
|
@functionstackx Hi, I have disable the bad machine and the actions have passed. May you help review it?
|
|
@cquil11 or @Oseltamivir csn u review this pr |
|
I am the original author of this pull request so GitHub doesn't let me approve my own pr |
|
@benenzhu one issue I see before I go to sleep is that vllm-project/recipes doesn't have an recipe for this minimax M2.5 mxfp4. We wanna ensure that the entire ml community benefits from ur hard work. Can u please create an recipe for it in that documentation |
Yeah, thanks for the review. I will raise one PR for this. FP4 uses a model path of Have a good night. |
|
Thanks @benenzhu ! Can u please merge this Kimi K2.5 mxfp4 recipe? It's been stuck unmerged for 3 weeks. |







Add MiniMax M2.5 MXFP4 benchmark config for MI355x with vLLM v0.17.1, now that AMD's MXFP4 checkpoint is out: https://huggingface.co/amd/MiniMax-M2.5-MXFP4
Closes #826
Generated with Claude Code