Rewrite ShapeFeature to not hold live variables by ricardoV94 · Pull Request #2056 · pymc-devs/pytensor

ricardoV94 · 2026-04-17T20:58:56Z

ShapeFeature reintroducing variables we have lowered/rewritten away is no-bueno

Details

Replace the eager per-variable dict (shape_of, shape_of_reverse_index, scheduled) with a lazy FrozenFunctionGraph-based shape kernel cache. For each Apply, a kernel built from dummy clones of node.inputs is stored in self._cache[node] and materialized against today's live inputs on demand via a custom frozen-graph walker (graph_replace would mutate globally-interned FrozenApply inputs).

The kernel holds only NominalVariables and Constants, so no live
variable can leak between tests or across rewrites, eliminating by
construction the stale-XRV class of bugs.

Back-compat surface (_LazyShapeTuple, _ShapeOfProxy, update_shape,
shape_ir, init_r) is retained and marked as temporary. A regression
test for the stale-XRV scenario replaces the prior xfail.

shape_of_variables switches to builders.infer_shape so it returns to
scalar-dim inputs instead of allocating per-input arrays.

local_track_shape_i no longer depends on the deleted scheduled dict;
it rewrites Shape_i(v, i) to get_shape(v, i) whenever the kernel
produces something other than the trivial fallback.

on_change_input carries r's inferred shape onto new_r as an override
when new_r's Op has no infer_shape, preserving the legacy behavior
where a well-inferred shape survives through a replacement with an
opaque op.

Benchmarks (cxx enabled):

radon_repeat 0.78s -> 0.55s (-30%)
radon_variants (8) 7.9s -> 7.2s ( -9%)
fusion_large 0.22s -> 0.22s (noise)
fusion_deep 13ms -> 13ms (noise)

`Alloc.do_constant_folding` listed `Elemwise | DimShuffle | Alloc | Join` and batched-`Blockwise` as protected client ops, but not `Subtensor`. `local_subtensor_of_alloc` rewrites `alloc(val, *shape)[idx]` into `alloc(val[...], *new_shape)` — preserving the Alloc structure that downstream rewrites like `local_blockwise_alloc_inputs` depend on. Folding the Alloc here short-circuited that lift and produced broadcast-equivalent `Constant` matrices whose batch dim was no longer type-broadcastable, so `local_blockwise_reshape` couldn't unwrap the surrounding `Blockwise(Reshape)`. Surfaced by the lazy-kernel `ShapeFeature` (which resolves `Subtensor(Shape(out), const)` to a scalar `Constant` earlier and makes more upstream Allocs constant-foldable), but the fix belongs here — the protection was too narrow.

Breaking API change: the `fgraph` argument was unused by every in-tree `infer_shape` implementation. Removing it makes `infer_shape` a pure function of `(node, input_shapes)`, simpler to call from outside an fgraph context (e.g. ShapeFeature's lazy kernel build) and tighter as a contract. External Ops with custom `infer_shape(self, fgraph, node, input_shapes)` must drop the `fgraph` parameter.

Add `break_aliasing_cycles` to `pytensor.graph.replace`. When an inplace Op overwrites input `x` and a single Apply ends up reading both `x` and a transitive dependent of the destroyer's output, no valid schedule exists. The helper re-routes such inputs through `deep_copy_op` to lift the conflict. Expose it via a `ShapeFeature.get_shape_no_cycle` convenience method, and use it from `introduce_explicit_core_shape_rv` and `introduce_explicit_core_shape_blockwise`, where lazy shape materialization can otherwise produce that pattern.

ricardoV94 · 2026-05-04T21:27:04Z

Rewrite time on the asv experiment down: https://ricardov94.github.io/pymc-model-catalogue/experiments.html#base=shape_feature_pr2056_base&compare=shape_feature_pr2056

This was referenced Apr 20, 2026

Fix centered prior with dims variables pymc-labs/pymc-marketing#2505

Closed

HSGP gradient silently broken in multidimensional MMM with time_varying_intercept=True (pymc-marketing ≥ 0.19.0) - XTensor pymc-labs/pymc-marketing#2519

Closed

ricardoV94 mentioned this pull request Apr 27, 2026

XTensorVariable flagged in _find_unallowed_rvs_in_graph pymc-devs/pymc#8269

Open

ricardoV94 force-pushed the shape_feature branch 12 times, most recently from f80be94 to aba080f Compare May 1, 2026 08:39

ricardoV94 mentioned this pull request May 1, 2026

Delay attaching ShapeFeature to logprob rewrites pymc-devs/pymc#8274

Merged

ricardoV94 force-pushed the shape_feature branch 3 times, most recently from 6f24fca to c37cb6b Compare May 1, 2026 16:00

ricardoV94 added 4 commits May 1, 2026 18:00

Do not leak stale variables in ShapeFeature

68e1643

ricardoV94 force-pushed the shape_feature branch from c37cb6b to 68e1643 Compare May 1, 2026 16:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rewrite ShapeFeature to not hold live variables#2056

Rewrite ShapeFeature to not hold live variables#2056
ricardoV94 wants to merge 4 commits into
pymc-devs:mainfrom
ricardoV94:shape_feature

ricardoV94 commented Apr 17, 2026 •

edited

Loading

Uh oh!

ricardoV94 commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ricardoV94 commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ricardoV94 commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ricardoV94 commented Apr 17, 2026 •

edited

Loading