Avoid differing results in `linalg.(tensor_)solve` #154983

Flamefire · 2025-06-03T09:10:05Z

Remove an optimization potentially using a transposed matrix as input for linalg_lu_factor_ex_out.

Depending on whether the input memory layout is contiguous or not this may lead to slightly different results which may cause larger differences in subsequent steps ultimately leading to test failures in e.g. test_vmapvjp_linalg_tensorsolve_cpu_float32 & test_vmapvjpvjp_linalg_tensorsolve_cpu_float32.

The intended optimization no longer applies after 59bc76f so this code can be removed too resolving the accuracy issues observed in those tests.

With this change the code path used for the "regular" and "vmap" cases are identical: A batched tensor is iterated over in the batch dimension in lu_solve and lu_factor
Prior to this it might not be the case as either tensor would/could have been non-contiguous leading to using a transposed tensor for the LU factorization instead.

So the (CPU) results should now be identical.

Fixes #151440
Fixes #114868

pytorch-bot · 2025-06-03T09:10:09Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/154983

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit 80295cd with merge base 81dbeb0 ():

NEW FAILURES - The following jobs have failed:

trunk / macos-py3-arm64 / test (mps, 1, 1, macos-m1-14) (gh)
test_mps.py::TestMPS::test_inverse
trunk / macos-py3-arm64 / test (mps, 1, 1, macos-m2-15) (gh)
test_mps.py::TestMPS::test_inverse

This comment was automatically generated by Dr. CI and updates every 15 minutes.

lezcano · 2025-06-03T12:41:36Z

@pytorchbot merge

pytorchmergebot · 2025-06-03T12:43:35Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

lezcano · 2025-06-03T12:59:11Z

the mps failures are real. @malfet is there anything that needs changing in the mps path?

pytorchmergebot · 2025-06-03T12:59:51Z

Merge failed

Reason: 3 jobs have failed, first few of them are: trunk / macos-py3-arm64 / test (mps, 1, 1, macos-m2-15), trunk / macos-py3-arm64 / test (mps, 1, 1, macos-m1-14), trunk / macos-py3-arm64 / test (mps, 1, 1, macos-m1-13)

Details for Dev Infra team

Raised by workflow job

albanD

Sounds good.
Is the MPS failure still a problem? Can you rebase to get fresh CI signal?

Flamefire requested review from IvanYashchuk, albanD, lezcano, nikitaved and soulitzer as code owners June 3, 2025 09:10

Flamefire mentioned this pull request Jun 3, 2025

Avoid differing results in linalg.(tensor_)solve #151896

Closed

Flamefire changed the title ~~Solve accuracy fix~~ Avoid differing results in linalg.(tensor_)solve Jun 3, 2025

Flamefire added the release notes: nn release notes category label Jun 3, 2025

pytorchbot added the open source label Jun 3, 2025

lezcano approved these changes Jun 3, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jun 3, 2025

pytorchmergebot added the merging label Jun 3, 2025

pytorchmergebot removed the merging label Jun 3, 2025

albanD approved these changes Jul 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Avoid differing results in `linalg.(tensor_)solve` #154983

Avoid differing results in `linalg.(tensor_)solve` #154983

Flamefire commented Jun 3, 2025

Uh oh!

pytorch-bot bot commented Jun 3, 2025 •

edited

Loading

Uh oh!

lezcano commented Jun 3, 2025

Uh oh!

pytorchmergebot commented Jun 3, 2025

Uh oh!

lezcano commented Jun 3, 2025

Uh oh!

pytorchmergebot commented Jun 3, 2025

Uh oh!

albanD left a comment

Uh oh!

Avoid differing results in linalg.(tensor_)solve #154983

Are you sure you want to change the base?

Avoid differing results in linalg.(tensor_)solve #154983

Conversation

Flamefire commented Jun 3, 2025

Uh oh!

pytorch-bot bot commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/154983

❌ 2 New Failures

Uh oh!

lezcano commented Jun 3, 2025

Uh oh!

pytorchmergebot commented Jun 3, 2025

Merge started

Uh oh!

lezcano commented Jun 3, 2025

Uh oh!

pytorchmergebot commented Jun 3, 2025

Merge failed

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

Avoid differing results in `linalg.(tensor_)solve` #154983

Avoid differing results in `linalg.(tensor_)solve` #154983

pytorch-bot bot commented Jun 3, 2025 •

edited

Loading