hooks/pyramid_attention_broadcast: remove redundant iteration==0 guard and fix stale cache VRAM leak by GitGlimpse895 · Pull Request #13497 · huggingface/diffusers

GitGlimpse895 · 2026-04-17T06:31:03Z

What does this PR do?

Fixes two bugs in PyramidAttentionBroadcastHook.new_forward:

1. Redundant iteration == 0 condition
After every reset_state(), both iteration and cache are reset to 0 and None respectively. So at iteration 0, self.state.cache is None is always True, making self.state.iteration == 0 permanently dead code that creates a misleading impression of two independent invariants.

2. Stale cache leaking GPU VRAM
When outside the active timestep range, the hook was unconditionally writing self.state.cache = output, holding a full hidden-state activation tensor on GPU until the next generation's reset_state() call. For video transformers with dozens of PAB-hooked layers, this accumulates hundreds of MBs of unreleased VRAM. The fix sets self.state.cache = None immediately when outside the range.

This is a re-submission of #13467 (accidentally self-closed). Apologies for the noise, @sayakpaul!

Checklist

Did you read the contributor guideline?
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

@sayakpaul @DN6

…d and fix stale cache VRAM leak

github-actions bot added hooks size/S PR with diff < 50 LOC labels Apr 17, 2026

GitGlimpse895 mentioned this pull request Apr 17, 2026

hooks/pyramid_attention_broadcast: fix redundant recompute at iteration 0 and free stale cache when outside timestep range #13467

Closed

6 tasks

GitGlimpse895 force-pushed the fix/pab-cache-logic branch from 5862b50 to 7eee423 Compare April 17, 2026 09:29

github-actions bot added size/S PR with diff < 50 LOC and removed size/S PR with diff < 50 LOC labels Apr 17, 2026

hooks/pyramid_attention_broadcast: remove redundant iteration==0 guar…

c971e07

…d and fix stale cache VRAM leak

GitGlimpse895 force-pushed the fix/pab-cache-logic branch from 7eee423 to c971e07 Compare April 18, 2026 11:09

github-actions bot added size/S PR with diff < 50 LOC and removed size/S PR with diff < 50 LOC labels Apr 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hooks/pyramid_attention_broadcast: remove redundant iteration==0 guard and fix stale cache VRAM leak#13497

hooks/pyramid_attention_broadcast: remove redundant iteration==0 guard and fix stale cache VRAM leak#13497
GitGlimpse895 wants to merge 1 commit intohuggingface:mainfrom
GitGlimpse895:fix/pab-cache-logic

GitGlimpse895 commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

GitGlimpse895 commented Apr 17, 2026

What does this PR do?

Checklist

Who can review?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant