fix(ppo): exclude no-eos rows from reward normalization#1351
Open
haoyang9804 wants to merge 2 commits into
Open
fix(ppo): exclude no-eos rows from reward normalization#1351haoyang9804 wants to merge 2 commits into
haoyang9804 wants to merge 2 commits into