Fix gradient calculation for Kalman. by AdrienCorenflos · Pull Request #213 · state-space-models/cuthbert

AdrienCorenflos · 2026-03-05T10:49:29Z

There were two issues:

using logdet(D_inv) create some failures in the JVP thereof, I'm still not sure why, it was related to solve(x, b) for a matrix x that was actually perfectly fine. I have replaced this with a better calculation for the determinant anyway, see derivation comment before it.
The QR decomposition failed to compute gradients when A had degenerate rank. This is normal because it is not defined where A is degenerate due to the QR decomposition being non-unique. However, what we use in probabilistic terms are gradients with respect to parameters of R @ R.T, which is unique and encodes the same info as cov = chol @ chol.T. So I fixed the tria JVP by making sure that the calculation of the gradient was correct for the span of R, and otherwise invariant for decompositions giving the same R @ R.T.

Closes #211

There were two issues: 1. using logdet(D_inv) create some failures in the JVP thereof, I'm still not sure why, it was related to solve(x, b) for a matrix x that was actually perfectly fine. I have replaced this with a better calculation for the determinant anyway, see derivation comment before it. 2. The QR decomposition failed to compute gradients when A had degenerate rank. This is normal because it is not defined where A is degenerate due to the QR decomposition being non-unique. However, what we use in probabilistic terms are gradients with respect to parameters of R @ R.T, which is unique and encodes the same info as cov = chol @ chol.T. So I fixed the tria JVP by making sure that the calculation of the gradient was correct for the span of R, and otherwise invariant for decompositions giving the same R @ R.T.

Sahel13 · 2026-03-09T19:17:01Z

@AdrienCorenflos isn't the following test supposed to pass (it doesn't)?

def test_tria_jvp_preserves_gram_matrix_for_rank_deficient_input():
    A = jnp.array([[1.0, 0.0], [1.0, 0.0], [2.0, 3.0]])
    dA = jnp.array([[0.0, 0.0], [1.0, 2.0], [3.0, 4.0]])

    def gram_via_tria(x):
        R = tria(x)
        return R @ R.T

    def gram_direct(x):
        return x @ x.T

    gram_jvp_from_tria = jax.jvp(gram_via_tria, (A,), (dA,))[1]
    gram_jvp_direct = jax.jvp(gram_direct, (A,), (dA,))[1]

    assert jnp.allclose(gram_jvp_from_tria, gram_jvp_direct)

cuthbertlib/linalg/tria.py

Co-authored-by: Sahel Iqbal <sahel13miqbal@proton.me>

AdrienCorenflos · 2026-03-09T19:21:09Z

@AdrienCorenflos isn't the following test supposed to pass (it doesn't)?

def test_tria_jvp_preserves_gram_matrix_for_rank_deficient_input():
    A = jnp.array([[1.0, 0.0], [1.0, 0.0], [2.0, 3.0]])
    dA = jnp.array([[0.0, 0.0], [1.0, 2.0], [3.0, 4.0]])

    def gram_via_tria(x):
        R = tria(x)
        return R @ R.T

    def gram_direct(x):
        return x @ x.T

    gram_jvp_from_tria = jax.jvp(gram_via_tria, (A,), (dA,))[1]
    gram_jvp_direct = jax.jvp(gram_direct, (A,), (dA,))[1]

    assert jnp.allclose(gram_jvp_from_tria, gram_jvp_direct)

How come the tests are passing?

Sahel13 · 2026-03-09T19:24:07Z

How come the tests are passing?

not sure tbh, still checking

edit: this one is a new one that i added, btw

AdrienCorenflos · 2026-03-09T19:39:20Z

I think that should pass though yes. What's the output?

Sahel13 · 2026-03-09T19:45:12Z

I think that should pass though yes. What's the output?

E       assert Array(False, dtype=bool)
E        +  where Array(False, dtype=bool) = <PjitFunction of <function allclose at 0x7f757c4d6480>>(Array([[ 1.,  1.,  7.],\n       [ 1.,  1.,  7.],\n       [ 7.,  7., 36.]], dtype=float64), Array([[ 0.,  1.,  3.],\n       [ 1.,  2., 11.],\n       [ 3., 11., 36.]], dtype=float64))
E        +    where <PjitFunction of <function allclose at 0x7f757c4d6480>> = jnp.allclose

AdrienCorenflos · 2026-03-09T19:46:33Z

Oh I see, I think your dA vectors are not in the span of A.

Edit: you can't have a tangent vector that's not compatible with the input. That's just not possible.

Sahel13 · 2026-03-09T19:54:57Z

ok yeah good point

edit: where is this requirement for dA to be in the span of A coming from, though? can't the input tangent be anything that has a valid shape? i can see that dR is constrained

AdrienCorenflos · 2026-03-09T22:24:56Z

I'll have a look, I think you're right and I ignored a term in the kernel of R I shouldn't have

…adient-kalman

AdrienCorenflos · 2026-03-10T08:43:42Z

@Sahel13 I had forgotten the null-space part of the gradient because of a stupid mistake!

I had written $R R^{\dagger} = I$ when it's actually only true if R is invertible...

AdrienCorenflos · 2026-03-10T08:46:10Z

I put in the test you suggested @Sahel13 and also increased precision for the kalman gradient test: what I thought was a numerical issue in finite differences was actually a plain bug!

tests/cuthbertlib/linalg/test_tria.py

Co-authored-by: Sahel Iqbal <sahel13miqbal@proton.me>

AdrienCorenflos added 2 commits March 5, 2026 10:47

Fix types

12f0b6d

AdrienCorenflos requested review from Sahel13 and SamDuffield March 5, 2026 10:49

pre commit

5e73190

AdrienCorenflos mentioned this pull request Mar 5, 2026

Add a test for valid gradients of MLL in KF #212

Closed

Sahel13 reviewed Mar 9, 2026

View reviewed changes

cuthbertlib/linalg/tria.py Outdated Show resolved Hide resolved

Update cuthbertlib/linalg/tria.py

3098368

Co-authored-by: Sahel Iqbal <sahel13miqbal@proton.me>

AdrienCorenflos added 3 commits March 10, 2026 08:42

Fix tria calculation.

71391a8

Merge remote-tracking branch 'origin/fix-gradient-kalman' into fix-gr…

fbe4f0c

…adient-kalman

pre-commit

1824802

Sahel13 approved these changes Mar 10, 2026

View reviewed changes

tests/cuthbertlib/linalg/test_tria.py Outdated Show resolved Hide resolved

Update tests/cuthbertlib/linalg/test_tria.py

7918200

Co-authored-by: Sahel Iqbal <sahel13miqbal@proton.me>

AdrienCorenflos merged commit 736c509 into main Mar 11, 2026
2 checks passed

AdrienCorenflos deleted the fix-gradient-kalman branch March 11, 2026 14:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix gradient calculation for Kalman.#213

Fix gradient calculation for Kalman.#213
AdrienCorenflos merged 8 commits intomainfrom
fix-gradient-kalman

AdrienCorenflos commented Mar 5, 2026

Uh oh!

Sahel13 commented Mar 9, 2026

Uh oh!

Uh oh!

AdrienCorenflos commented Mar 9, 2026 •

edited

Loading

Uh oh!

Sahel13 commented Mar 9, 2026 •

edited

Loading

Uh oh!

AdrienCorenflos commented Mar 9, 2026

Uh oh!

Sahel13 commented Mar 9, 2026

Uh oh!

AdrienCorenflos commented Mar 9, 2026 •

edited

Loading

Uh oh!

Sahel13 commented Mar 9, 2026 •

edited

Loading

Uh oh!

AdrienCorenflos commented Mar 9, 2026

Uh oh!

AdrienCorenflos commented Mar 10, 2026

Uh oh!

AdrienCorenflos commented Mar 10, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

AdrienCorenflos commented Mar 5, 2026

Uh oh!

Sahel13 commented Mar 9, 2026

Uh oh!

Uh oh!

AdrienCorenflos commented Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Sahel13 commented Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AdrienCorenflos commented Mar 9, 2026

Uh oh!

Sahel13 commented Mar 9, 2026

Uh oh!

AdrienCorenflos commented Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Sahel13 commented Mar 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AdrienCorenflos commented Mar 9, 2026

Uh oh!

AdrienCorenflos commented Mar 10, 2026

Uh oh!

AdrienCorenflos commented Mar 10, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AdrienCorenflos commented Mar 9, 2026 •

edited

Loading

Sahel13 commented Mar 9, 2026 •

edited

Loading

AdrienCorenflos commented Mar 9, 2026 •

edited

Loading

Sahel13 commented Mar 9, 2026 •

edited

Loading