-
Notifications
You must be signed in to change notification settings - Fork 3.1k
Support deepseek v3 #9835
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
Support deepseek v3 #9835
Conversation
support append_attn c16 for deep-seek-v3
Fix rope&fix precision
…nto support-deepseek-v3
Thanks for your contribution! |
…nto support-deepseek-v3
…nto support-deepseek-v3
…nto support-deepseek-v3
support mla for speculate
Codecov ReportAttention: Patch coverage is
❌ Your patch check has failed because the patch coverage (0.64%) is below the target coverage (80.00%). You can increase the patch coverage or adjust the target coverage. Additional details and impacted files@@ Coverage Diff @@
## develop #9835 +/- ##
===========================================
- Coverage 49.95% 49.89% -0.06%
===========================================
Files 757 757
Lines 122540 122691 +151
===========================================
+ Hits 61215 61219 +4
- Misses 61325 61472 +147 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
root seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account. You have signed the CLA already but the status is still pending? Let us recheck it. |
This Pull Request is stale because it has been open for 60 days with no activity. 当前Pull Request 60天内无活动,被标记为stale。 |
Before submitting
tests
folder. If there are codecov issues, please add tests cases first.PR types
PR changes
Description