Skip to content

Add Llama 3.1 405B 128gpus FP8CS recipe for A4X #recipebot#210

Merged
tonyjohnchen merged 1 commit intoAI-Hypercomputer:mainfrom
weikuo0506:weikuo-llama31-405b-fp8cs-recipe
May 5, 2026
Merged

Add Llama 3.1 405B 128gpus FP8CS recipe for A4X #recipebot#210
tonyjohnchen merged 1 commit intoAI-Hypercomputer:mainfrom
weikuo0506:weikuo-llama31-405b-fp8cs-recipe

Conversation

@weikuo0506
Copy link
Copy Markdown
Contributor

Adding pretraining recipe for Llama 3.1 405B on 32-node H200 (A4X) using FP8CS. runid: megatron_bridge_training-nemo2602/llama31_405b_128gpus_fp8cs_seq8192_gbs64-2026-04-19_110753-e40edbf2-1b55-4581-b0b5-ebc2ac9f2de6

@weikuo0506 weikuo0506 marked this pull request as ready for review April 28, 2026 11:38
Copy link
Copy Markdown
Collaborator

@tonyjohnchen tonyjohnchen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@tonyjohnchen tonyjohnchen merged commit b1e3bb9 into AI-Hypercomputer:main May 5, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants