-
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedDec 23, 2025 -
mbridge Public
Forked from ISEEKYAN/mbridgeBridge Megatron-Core to Hugging Face/Reinforcement Learning
Python Other UpdatedNov 13, 2025 -
AReaL Public
Forked from inclusionAI/AReaLDistributed RL System for LLM Reasoning
Python Apache License 2.0 UpdatedJun 13, 2025 -
alphaxiv-open Public
Forked from AsyncFuncAI/alphaxiv-openAlphaXIV open-source alternative: Chat with any arXiv paper.
Python MIT License UpdatedMay 24, 2025 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedApr 16, 2025 -
DualPipe Public
Forked from deepseek-ai/DualPipeA bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
Python MIT License UpdatedMar 10, 2025 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedFeb 26, 2025 -
LLaMA-Factory Public
Forked from hiyouga/LlamaFactoryUnified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Python Apache License 2.0 UpdatedFeb 19, 2025