-
Notifications
You must be signed in to change notification settings - Fork 25.5k
Open
Labels
module: 64-bitProblems related to incorrectly using 32-bit integers when 64-bit is needed (e.g., 8G tensors)Problems related to incorrectly using 32-bit integers when 64-bit is needed (e.g., 8G tensors)module: crashProblem manifests as a hard crash, as opposed to a RuntimeErrorProblem manifests as a hard crash, as opposed to a RuntimeErrormodule: mpsRelated to Apple Metal Performance Shaders frameworkRelated to Apple Metal Performance Shaders frameworktriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate moduleThis issue has been looked at a team member, and triaged and prioritized into an appropriate module
Description
🐛 Describe the bug
Random ATen operations that use MPS backend can fail with Error: total bytes of NDArray > 2**32
or other such errors that expect either tensor size to be less than 4Gb, or total number of elements to be indexable by 32-bit index.
This is umbrella issue to track those (should be searchable by module: mps
+ module: 64-bit
)
Need to figure out some tooling to detect broken ops and how to run tests, as allocating large tensors on machines we have right now simply not going to work
Example of existing issues:
- MPS Error: NDArray > 2^32 bytes in scaled_dot_product_attention #149261
- On the Apple M1 GPU, torch.where() finds tensor entries in the wrong place #140570
- MPS torch.where() is giving objectively incorrect results, leading to critical calculation errors #122916
- MPS MatMul (even 2d) broken #116769
- MPS MatMul (even 2d) broken #116769
- MPSNDArray 限制了单个 NDArray 的内存大小上限为 4GB #143859
- [MPS] "Can't be indexed using 32-bit iterator" error as of 20250430 nightly cpu MacOS build #154828
Versions
CI
Metadata
Metadata
Assignees
Labels
module: 64-bitProblems related to incorrectly using 32-bit integers when 64-bit is needed (e.g., 8G tensors)Problems related to incorrectly using 32-bit integers when 64-bit is needed (e.g., 8G tensors)module: crashProblem manifests as a hard crash, as opposed to a RuntimeErrorProblem manifests as a hard crash, as opposed to a RuntimeErrormodule: mpsRelated to Apple Metal Performance Shaders frameworkRelated to Apple Metal Performance Shaders frameworktriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate moduleThis issue has been looked at a team member, and triaged and prioritized into an appropriate module