Skip to content

Conversation

@lizhenyun01
Copy link
Collaborator

PR types

New features

PR changes

Others

Description

support mla for speculate

@lizhenyun01 lizhenyun01 merged commit 4b11328 into yuanlehome:support-deepseek-v3 Feb 12, 2025
2 checks passed
yuanlehome pushed a commit that referenced this pull request Mar 4, 2025
* fix ppo and grpo v1

* update grpo

* delete notes and modify argument (#10)

* [RL] Fix PPO and add GRPO  (#11)

* delete notes and modify argument

* delete ppo_config.json

* modify format

* lint

* fix model config set

* fix grpo (#12)

* [New Features] support json file data (#13)

* delete notes and modify argument

* delete ppo_config.json

* modify format

* support json data

* modify argument

* fix

* fix ci

* fix

* fix datapath (#14)

* delete notes and modify argument

* delete ppo_config.json

* modify format

* support json data

* modify argument

* fix data

---------

Co-authored-by: greycooker <[email protected]>
Co-authored-by: gongel <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant