Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: fix mcore train_iters in grpo CI:L1 Run doctests, unit tests, and functional tests
#1383 opened Oct 17, 2025 by yuki-97 Loading…
chore: use pydantic for yaml test validation documentation Improvements or additions to documentation r0.4.0
#1382 opened Oct 17, 2025 by terrykong Loading…
feat: add capability to update weights inflight during generation CI:L1 Run doctests, unit tests, and functional tests
#1381 opened Oct 16, 2025 by parthchadha Loading…
4 tasks
feat: Overlap param iteration and broadcast in non-colocated refit CI:L1 Run doctests, unit tests, and functional tests Performance Related to improving performance
#1379 opened Oct 16, 2025 by youngeunkwon0405 Draft
4 tasks
feat: additional validation losses for preference data documentation Improvements or additions to documentation
#1367 opened Oct 15, 2025 by jveronvialard Draft
4 tasks
Update RL to use megatron-bridge tot
#1358 opened Oct 14, 2025 by yaoyu-33 Loading…
4 tasks
feat: GSPO-token
#1357 opened Oct 14, 2025 by pjin-nvidia Draft
4 tasks
feat: add kl penalty k1, k2 CI:L1 Run doctests, unit tests, and functional tests
#1349 opened Oct 13, 2025 by yuki-97 Draft
feat: support truncated importance sampling CI:L1 Run doctests, unit tests, and functional tests
#1348 opened Oct 13, 2025 by yuki-97 Loading…
fix: Fix policy worker placement when using unified placement group CI:L1 Run doctests, unit tests, and functional tests r0.4.0
#1341 opened Oct 11, 2025 by guyueh1 Loading…
4 tasks
Docs: Refactor Home Page and New About Section documentation Improvements or additions to documentation r0.4.0
#1338 opened Oct 10, 2025 by jgerh Loading…
feat: Integrate Penguin env
#1336 opened Oct 10, 2025 by bxyu-nvidia Loading…
4 tasks
chore: big version bump
#1334 opened Oct 10, 2025 by terrykong Draft
feat: add Megatron support for on-policy distillation CI:L1 Run doctests, unit tests, and functional tests r0.4.0
#1324 opened Oct 9, 2025 by zpqiu Loading…
4 tasks done
feat: Onboard perf recipes in tests
#1322 opened Oct 8, 2025 by guyueh1 Draft
4 tasks
feat: Add debugger flag that can be turned on via RAY_DEBUG=legacy CI:L0 Run doctests and unit tests documentation Improvements or additions to documentation
#1312 opened Oct 8, 2025 by guyueh1 Loading…
4 tasks
feat: [Draft Do Not merge] Kitchen interface
#1310 opened Oct 8, 2025 by guyueh1 Draft
4 tasks
fix: more robust fp8 rollout metric check CI:L0 Run doctests and unit tests r0.4.0
#1307 opened Oct 8, 2025 by terrykong Loading…
4 tasks
User/joyang/megatron cmp
#1298 opened Oct 7, 2025 by joyang-nv Draft
4 tasks
feat: Generation sampling params
#1290 opened Oct 6, 2025 by pjin-nvidia Draft
4 tasks
ProTip! Filter pull requests by the default branch with base:main.