pullback reorganization by Jutho · Pull Request #208 · QuantumKitHub/MatrixAlgebraKit.jl

Jutho · 2026-04-17T23:02:53Z

This PR does a number of pullback-related things:

It moves the remove_x_gauge_dependence! functions from the tests to the main repository, so that they can be reused by higher level packages (TensorKit), and that it is easier to keep them in sync with the functions that test the gauge dependence.
Because testing the gauge dependence in the adjoints requires some of the same intermediate calculations as the actual computation of the pullback, and so some computations were performed twice before, there has been some restructuring resulting in a rename check_x_cotangents to check_and_prepare_x_cotangents.
The svd_pullback implementation has been fixed to accept svd_full pullbacks, and should also be more robust for pullbacks resulting from an svd_trunc with arbitrary ind.
The implementation of the svd_trunc_pullback that computes the pullback without depending on the full SVD of A has been changed, because the old one was reportedly (thanks @pbrehmer) very slow for larger matrices, and the Sylvester solver is not available on GPU hardware. Instead, the Sylvester equation is now explicitly solved as a geometric series. This requires another parameter, called maxiter (number of terms being kept in the geometric series). It also requires that it are the largest singular values that are being kept, as the convergence rate is essentially determined by the ratio "largest truncated singular value" / "smallest kept singular value". This implementation is only tested in the chainrules tests, where I already noticed that there were random cases where I needed quite a large number of iterations for convergence (100 was not enough, I bumped it to 1000 which does seem enough for the tests). This could be useful in combination with randomized svd, where because of oversampling you probably do have an explicit value for "largest truncated singular value", such that you can estimate the required number of iterations beforehand.

Things to do:

Actually test new svd_trunc_pullback on GPU in combination with randomized SVD
The following suggestion could probably dramatically speed up convergence, as it changes from linear convergence to quadratic convergence (which is the typical rate at which dense eigenvalue decomposition / svd converges, basically the error is squared in every iteration and reaches sub-machine precision in a handful of iterations). I will try to implement this instead asap:

Jutho · 2026-04-18T20:53:32Z

Ok, I've now implemented the quadratic approach, and it is indeed much faster, converging in typically less than 10 iterations for double precision. It is called svd_trunc_pullback2!, but should replace svd_trunc_pullback!. I'd be interested in hearing about some real case timings from @pbrehmer 😄 .

pbrehmer · 2026-04-19T17:51:54Z

Thanks for the upgraded pullbacks! I have some real case timings but using my own implementation where the Sylvester equation is solved using KrylovKit's linsolve. It still might be interesting to see since the scaling should be comparable, I think. My real test case is the fixed-point gradient linear problem of asymmetric CTMRG (without spatial symmetries) for the Fermi-Hubbard model. The benchmarks are run on multiple intermediately-optimized PEPS and then averaged. I plot the timings on a log-log scale as a function of $D^2\chi$ (the matrix that is decomposed is $D^2\chi \times D^2\chi$-dimensional) and normalize them by the CTMRG contraction times (CTMRG runs use a hot start):

It's pretty clear that linsolve outperforms the LAPACK sylvester solver already at moderate dimensions and that the scaling is much better. And it's also interesting to see that indeed the sylvester solver approximately has the same scaling as the CTMRG contraction step, i.e. the full SVD.

Jutho · 2026-04-19T18:48:57Z

Which pullback is this now using? Is that the previous svd_trunc_pullback! (current main branch or latest release)? Or one of the two pullbacks svd_trunc_pullback! or svd_trunc_pullback2! in this branch?

Jutho · 2026-04-19T18:49:16Z

Also, thanks for the very quick weekend response.

Jutho · 2026-04-19T19:11:22Z

This PR seems to have broken the GPU extensions (both CUDA and AMD). Not sure what is causing this.

pbrehmer · 2026-04-20T08:29:29Z

Which pullback is this now using? Is that the previous svd_trunc_pullback! (current main branch or latest release)? Or one of the two pullbacks svd_trunc_pullback! or svd_trunc_pullback2! in this branch?

It was using the svd_trunc_pullback! from the latest release (v0.6.5). What I should mention again is that linsolve will only converge this fast if it is preconditioned properly with the inverse singular values. Here are the lines of code which I changed in svd_trunc_pullback! that implement this preconditioning:

# replace XY = _sylvester(ÃÃ, -Smat, rhs) with linsolve
Smat⁻¹ = diagm(inv_safe.(S, degeneracy_atol))
f(xy) = ÃÃ * xy * Smat⁻¹ - xy
XY₀ = zeros(scalartype(ÃÃ), size(ÃÃ, 2), size(Smat⁻¹, 1))
XY, info = linsolve(f, -rhs * Smat⁻¹, XY₀, solver_alg)

Jutho · 2026-04-20T09:00:40Z

It was using the svd_trunc_pullback! from the latest release (v0.6.5). What I should mention again is that linsolve will only converge this fast if it is preconditioned properly with the inverse singular values. Here are the lines of code which I changed in svd_trunc_pullback! that implement this preconditioning:

@pbrehmer , would it be much work for you to run the same benchmark with svd_trunc_pullback! and svd_trunc_pullback2! from this branch (especially the latter) and add the result to that plot?

For very large matrix size, I agree that Krylov will still be the preferred option, but also the svd_trunc itself should be done with Krylov methods then. It is also a fact that if you did compute the full svd, then simply svd_pullback! will still be the fastest. So I think these svd_trunc_pullback(2)! methods really only make sense in combination with something like randomized svd, but still it would be interesting to see the relative performance of each of them.

pbrehmer · 2026-04-20T09:34:57Z

@pbrehmer , would it be much work for you to run the same benchmark with svd_trunc_pullback! and svd_trunc_pullback2! from this branch (especially the latter) and add the result to that plot?

I can do that but currently my computational resources on the cluster are blocked by some other simulations so it will take a bit!

For very large matrix size, I agree that Krylov will still be the preferred option, but also the svd_trunc itself should be done with Krylov methods then. It is also a fact that if you did compute the full svd, then simply svd_pullback! will still be the fastest. So I think these svd_trunc_pullback(2)! methods really only make sense in combination with something like randomized svd, but still it would be interesting to see the relative performance of each of them.

Yes indeed. For these benchmarks I explicitly wanted to check the performance of the trunc pullbacks but it was still more convenient to just compute the forward using a full SVD.

lkdvos

It is probably also worth it to add the remove_f_gauge_dependence functions to the public list, given that we explicitly intend them to be used by TensorKit.

Overall looks like a great PR though, this should hopefully stabilize some of the issues we've been having 🥳

Some remaining question that definitely don't have to be addressed here but is worth bringing up:

Is it worth it to refactor the sylvester solver into its own function, or is this one too hand-crafted for this specific purpose? This might make it more convenient to swap out the Krylov-based solver, but I don't mean to add too much burden to this PR/this package for that either.

Jutho · 2026-04-20T13:15:03Z

Is it worth it to refactor the sylvester solver into its own function, or is this one too hand-crafted for this specific purpose? This might make it more convenient to swap out the Krylov-based solver, but I don't mean to add too much burden to this PR/this package for that either.

I think it is pretty specific; where else do you think we might use this?

lkdvos · 2026-04-20T14:16:32Z

I really didnt look at this carefully, as in, is this just a regular Sylvester silver? In that case it shows up in the Clebsch-Gordan equation for finite groups

Jutho · 2026-04-20T14:41:23Z

I really didnt look at this carefully, as in, is this just a regular Sylvester silver? In that case it shows up in the Clebsch-Gordan equation for finite groups

No not at all; it uses that one of the two matrices is diagonal (or more generally, easily invertible), and furthermore assumes that the smallest singular value of that matrix is larger than the largest singular value of the other matrix (or more generally: opnorm(inv(B)) * opnorm(A) < 1)

Jutho · 2026-04-20T14:43:14Z

I am happy to discuss the CG of finite group case though, maybe there are some other tricks that can be used.

Jutho · 2026-04-20T14:45:26Z

Enzyme tests seem to be taken an extreme amount of time / GC / allocations. Not sure if this is a consequence of the changes here, I will have to check.

kshyatt · 2026-04-22T07:55:19Z

Enzyme tests seem to be taken an extreme amount of time / GC / allocations. Not sure if this is a consequence of the changes here, I will have to check.

I think it's a consequence of how the Enzyme tests work. Basically, they use autodiff_thunk to generate the functions for forward and reverse pass, then carefully check all the memory properties and correctness. This is great and very thorough but also expensive, so there's a large compilation burden associated with each test.

codecov · 2026-04-22T14:57:21Z

Codecov Report

❌ Patch coverage is 96.83099% with 9 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/pullbacks/svd.jl	95.00%	8 Missing ⚠️
...ixAlgebraKitEnzymeExt/MatrixAlgebraKitEnzymeExt.jl	0.00%	1 Missing ⚠️

Files with missing lines	Coverage Δ
...gebraKitMooncakeExt/MatrixAlgebraKitMooncakeExt.jl	`63.38% <100.00%> (-0.93%)`	⬇️
src/MatrixAlgebraKit.jl	`100.00% <ø> (ø)`
src/pullbacks/eig.jl	`87.50% <100.00%> (+1.29%)`	⬆️
src/pullbacks/eigh.jl	`83.14% <100.00%> (+1.89%)`	⬆️
src/pullbacks/lq.jl	`97.75% <100.00%> (+0.45%)`	⬆️
src/pullbacks/qr.jl	`96.73% <100.00%> (+0.68%)`	⬆️
...ixAlgebraKitEnzymeExt/MatrixAlgebraKitEnzymeExt.jl	`1.37% <0.00%> (+0.08%)`	⬆️
src/pullbacks/svd.jl	`90.00% <95.00%> (+0.74%)`	⬆️

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

kshyatt · 2026-04-22T14:58:01Z

GPU CI now passing thanks to a Buildkite fix.

Jutho · 2026-04-22T20:39:06Z

Great. Not sure why it is failing on Ubuntu latest; probably timeout? I will already swap svd_trunc_pullback! with svd_trunc_pullback2!, as the latter should be much faster (even though these are only tested in the chainrules tests if I am correct).

kshyatt · 2026-04-23T05:02:54Z

TBH I think the speed of the pullback itself won't change things much, it's entirely compilation time in autodiff_thunk in Enzyme.Compiler and in the GC

kshyatt · 2026-04-23T08:27:36Z

Just as a note: this should close #150, right?

Co-authored-by: Lukas Devos <ldevos98@gmail.com> Co-authored-by: Jutho <Jutho@users.noreply.github.com>

Co-authored-by: Lukas Devos <ldevos98@gmail.com>

github-actions · 2026-04-23T12:09:53Z

Your PR no longer requires formatting changes. Thank you for your contribution!

lkdvos

@Jutho I rebased on top of the latest main, and added the gauge dependence removal to the public list. I don't think the performance of the new SVD pullback is required to already merge this, mostly to have the fix of the full SVD out as soon as possible?

Jutho · 2026-04-23T13:57:30Z

Ok. It wasn't really my intention to keep svd_trunc_pullback2! around after some real-world benchmarks confirmed that it is indeed much slower. So the question is whether we merge as is and remove it later? As long as it is not marked public, that should be fine without causing breaking changes, right.

Co-authored-by: Jutho <Jutho@users.noreply.github.com>

Jutho · 2026-04-23T15:07:36Z

One final change could be to replace Sinv with S⁻¹ in svd_trunc_pullback!.

* Update changelog for v0.6.6 * Bump version to v0.6.6

lkdvos reviewed Apr 20, 2026

View reviewed changes

kshyatt reviewed Apr 20, 2026

View reviewed changes

Comment thread src/pullbacks/svd.jl Outdated

Jutho commented Apr 20, 2026

View reviewed changes

Comment thread src/pullbacks/lq.jl Outdated

lkdvos reviewed Apr 20, 2026

View reviewed changes

Comment thread src/pullbacks/svd.jl Outdated

Jutho and others added 6 commits April 23, 2026 08:09

pullback reorganization

306c42d

add quadratic svd_trunc_pullback

1c64fd7

Apply suggestions from code review

b071177

Co-authored-by: Lukas Devos <ldevos98@gmail.com> Co-authored-by: Jutho <Jutho@users.noreply.github.com>

Apply more suggestions from code review

44c63d7

Co-authored-by: Lukas Devos <ldevos98@gmail.com>

some changes from review

7fdc9b4

fixes and improved numerical stability

b5a73b1

one more code suggestion

b4df95c

Co-authored-by: Lukas Devos <ldevos98@gmail.com>

lkdvos force-pushed the jh/svdpullback branch from 1f85308 to bbb61f9 Compare April 23, 2026 12:09

lkdvos linked an issue Apr 23, 2026 that may be closed by this pull request

SVD pullback of rank-deficient matrix #150

Closed

lkdvos linked an issue Apr 23, 2026 that may be closed by this pull request

svd_pullback doesn't handle output from svd_full #96

Closed

lkdvos force-pushed the jh/svdpullback branch from bbb61f9 to bb3713e Compare April 23, 2026 12:12

lkdvos approved these changes Apr 23, 2026

View reviewed changes

mark gauge dependence removal as public

62a5971

lkdvos force-pushed the jh/svdpullback branch from bb3713e to 62a5971 Compare April 23, 2026 12:18

Jutho commented Apr 23, 2026

View reviewed changes

Comment thread src/pullbacks/svd.jl Outdated

Update src/pullbacks/svd.jl

e7aa6e4

Co-authored-by: Jutho <Jutho@users.noreply.github.com>

lkdvos added 4 commits April 23, 2026 11:18

improve error messages

36d7581

remove unused function

995dccb

formatting

ab43dd5

more unicode

efa0171

This was referenced Apr 23, 2026

Dimension mismatch in SVD pullback due to rank_atol for FixedSVD QuantumKitHub/PEPSKit.jl#351

Open

Fix dimension mismatches in SVD pullback due to rank_atol QuantumKitHub/PEPSKit.jl#357

Closed

lkdvos merged commit 54f6021 into main Apr 23, 2026
10 checks passed

lkdvos deleted the jh/svdpullback branch April 23, 2026 19:05

lkdvos mentioned this pull request Apr 24, 2026

Release v0.6.6 #216

Merged

lkdvos referenced this pull request Apr 24, 2026

Release v0.6.6 (#216)

0d96b63

* Update changelog for v0.6.6 * Bump version to v0.6.6

Conversation

Jutho commented Apr 17, 2026

Uh oh!

Jutho commented Apr 18, 2026

Uh oh!

pbrehmer commented Apr 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Jutho commented Apr 19, 2026

Uh oh!

Jutho commented Apr 19, 2026

Uh oh!

Jutho commented Apr 19, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pbrehmer commented Apr 20, 2026

Uh oh!

Uh oh!

Jutho commented Apr 20, 2026

Uh oh!

Uh oh!

pbrehmer commented Apr 20, 2026

Uh oh!

lkdvos left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Jutho commented Apr 20, 2026

Uh oh!

lkdvos commented Apr 20, 2026

Uh oh!

Uh oh!

Jutho commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Jutho commented Apr 20, 2026

Uh oh!

Jutho commented Apr 20, 2026

Uh oh!

kshyatt commented Apr 22, 2026

Uh oh!

codecov Bot commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

kshyatt commented Apr 22, 2026

Uh oh!

Jutho commented Apr 22, 2026

Uh oh!

kshyatt commented Apr 23, 2026

Uh oh!

kshyatt commented Apr 23, 2026

Uh oh!

github-actions Bot commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lkdvos left a comment

Choose a reason for hiding this comment

Uh oh!

Jutho commented Apr 23, 2026

Uh oh!

Uh oh!

pbrehmer commented Apr 19, 2026 •

edited

Loading

Jutho commented Apr 20, 2026 •

edited

Loading

codecov Bot commented Apr 22, 2026 •

edited

Loading

github-actions Bot commented Apr 23, 2026 •

edited

Loading