-
Notifications
You must be signed in to change notification settings - Fork 228
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: Mask sequences with high logprob error
CI:L1
Run doctests, unit tests, and functional tests
#1838
opened Jan 29, 2026 by
yfw
Loading…
4 tasks
fix: allow multi epoch training for async grpo
#1836
opened Jan 28, 2026 by
parthchadha
Loading…
4 tasks
feat: Allow loading of more general data types
CI:L1
Run doctests, unit tests, and functional tests
community-request
#1834
opened Jan 28, 2026 by
nathan-az
Loading…
Mcore dp coordinator implementation initial
#1833
opened Jan 27, 2026 by
shanmugamr1992
Loading…
4 tasks
docs: Add notes for FP8 recipe in docs/fp8.md
CI:docs
Run doctest
documentation
Improvements or additions to documentation
#1829
opened Jan 26, 2026 by
guyueh1
Loading…
4 tasks
feat: add lora config for dpo dtensor backend
CI:L1
Run doctests, unit tests, and functional tests
#1826
opened Jan 26, 2026 by
RayenTian
Loading…
4 tasks
ci: Allow repo to self publish docs
CI
Relating to CI
#1821
opened Jan 23, 2026 by
chtruong814
Loading…
4 tasks
perf: Update cudnn to 9.14
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1820
opened Jan 23, 2026 by
guyueh1
Loading…
4 tasks
fix: fix statistic of probs_ratio_clamped_min/max
CI:L1
Run doctests, unit tests, and functional tests
#1818
opened Jan 23, 2026 by
yuki-97
Loading…
fix: Unify custom model logits extraction across all inference methods
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1815
opened Jan 23, 2026 by
zpqiu
Loading…
4 tasks
feat: Implement ProRLv2 recipe
CI:L1
Run doctests, unit tests, and functional tests
#1809
opened Jan 22, 2026 by
hijkzzz
Loading…
chore: cuda13 support
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1803
opened Jan 21, 2026 by
guyueh1
Loading…
4 tasks
feat: Timer for the data sharding and job submission
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#1802
opened Jan 21, 2026 by
guyueh1
Loading…
4 tasks
feat: Support lora in dtensor grpo workflow by merging weight
CI:L1
Run doctests, unit tests, and functional tests
#1797
opened Jan 20, 2026 by
RayenTian
Loading…
feat: add speculative decoding during post-training
#1785
opened Jan 15, 2026 by
isomap
Loading…
2 of 4 tasks
feat: NeMo Gym GRPO on-policy fix params; Per-agent group-level rewards
CI:L1
Run doctests, unit tests, and functional tests
#1779
opened Jan 15, 2026 by
bxyu-nvidia
Loading…
4 tasks
refactor: split train and val dataset in preference dataset
CI:L1
Run doctests, unit tests, and functional tests
documentation
Improvements or additions to documentation
#1763
opened Jan 13, 2026 by
yuki-97
Loading…
[docs] Document Gym + RL integration design
documentation
Improvements or additions to documentation
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.