Skip to content

Conversation

yangshiqi
Copy link

The current code directly returns the user-configured limit as *total without considering the actual hardware capacity limit.

After the fix, even if CUDA_DEVICE_MEMORY_LIMIT=15360MB is set, the actual total returned will not exceed the actual available memory of the T4 card (about 14917MB), thus avoiding the "No execution plan worked!" error.

Copy link

hami-robott bot commented Jun 11, 2025

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: yangshiqi

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@hami-robott hami-robott bot added the size/S label Jun 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant