Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

chore: bump Megatron-Bridge to latest main (7110a96) documentation Improvements or additions to documentation
#2223 opened Apr 7, 2026 by yuki-97 Draft
Full Dynamo integration
#2222 opened Apr 6, 2026 by jthomson04 Draft
4 tasks
chore: upgrade Python from 3.12 to 3.13 CI:L1 Run doctests, unit tests, and functional tests documentation Improvements or additions to documentation
#2220 opened Apr 6, 2026 by kajalj22 Draft
3 tasks
Fix preserving dataset merge for SFT community-request needs-follow-up Issue needs follow-up
#2218 opened Apr 6, 2026 by Bungmint Loading…
feat: add SDPO algorithm community-request
#2217 opened Apr 6, 2026 by celineltan Loading…
1 of 4 tasks
chore(beep boop 🤖): bump FW-CI-templates workflow pins to v0.88.0 CI Relating to CI
#2205 opened Apr 3, 2026 by svcnvidia-nemo-ci Loading…
1 task
fix: NVML memory query fallback for DGX Spark community-request needs-follow-up Issue needs follow-up
#2203 opened Apr 3, 2026 by dbuos Loading…
docs: fix typos, grammar, and table issues from QA documentation Improvements or additions to documentation
#2193 opened Apr 2, 2026 by anwithk Loading…
4 tasks
feat: CISPO implementation
#2187 opened Apr 1, 2026 by slikhite-1 Loading…
4 tasks
fix: skip loading reference model when KL penalty is zero
#2178 opened Mar 31, 2026 by yfw Loading…
4 tasks
fix: use prompt token length for advantage group extraction CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version) super-v3
#2176 opened Mar 30, 2026 by yfw Loading…
4 tasks
feat: Merge megatron checkpoints with lora adapters and convert to HF format CI:Lfast Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version) community-request documentation Improvements or additions to documentation needs-follow-up Issue needs follow-up
#2173 opened Mar 30, 2026 by pengdurice Loading…
4 tasks done
ProTip! What’s not been updated in a month: updated:<2026-03-07.