Skip to content

Conversation

SylarTiaNII
Copy link
Contributor

PR types

Others

PR changes

Others

Description

  • ZCC supports saving rng states
  • Optimize ZCC save time
  • Qwen-A3B supports TP-MoE
  • Qwen-A3B supports fused ffn and attn
  • Increase gate precision of Qwen-A3B to FP32
  • fix normal save load
  • support disable paddle api monkey patch by FLAG
  • use safer grad sync when enabling sequence parallel

Copy link

paddle-bot bot commented Sep 2, 2025

Thanks for your contribution!

@SylarTiaNII SylarTiaNII changed the title Cherry-pick useful PR from fleety Cherry-pick useful PRs from fleety Sep 2, 2025
Copy link
Collaborator

@From00 From00 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@From00 From00 merged commit 68645cc into PaddlePaddle:develop Sep 5, 2025
6 of 7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants