Skip to content

[ROCm]: fix: reduce pipeline temp memory — replace ppermute collectives with lax.slice/pad (PR2) #4192

Open
cj401-amd wants to merge 3 commits into
AI-Hypercomputer:mainfrom
cj401-amd:cj/tmem-fixes-clean-2-pipeline-tmem
Open

[ROCm]: fix: reduce pipeline temp memory — replace ppermute collectives with lax.slice/pad (PR2) #4192
cj401-amd wants to merge 3 commits into
AI-Hypercomputer:mainfrom
cj401-amd:cj/tmem-fixes-clean-2-pipeline-tmem

fix: pipeline tmem reduction — replace ppermute collectives, expose p…

62907cf
Select commit
Loading
Failed to load commit list.
Google CLA / cla/google succeeded Jun 18, 2026 in 7s

✅ All contributors are covered under a CLA with Google

See https://cla.developers.google.com/ for more info about Google's Contributor License Agreement (CLA).

ℹ️ Googlers: Go here to view more details and manage scans for this pull request.

Details

The following contributors were found for this pull request:

62907cf Author: @cj401-amd <ch****in​@amd.com>

(Only the first commit for a unique contributor is listed.)