Replies: 1 comment 1 reply
-
We don't look at NaN afair. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I've created an auto-resetting transformed environment where observation at done steps are marked as NaN. Then I tried to use the bundled GAE with the rollout data of my env, I found NaNs in the output entries of the GAE. Here is my transformed env.
And GAE is used like this: (btw am I using the right selection of data?)
Then by printing "advantage", "state_value", and "value_target", I found NaNs in the tensor. So my question is: does the GAE implementation consider autoresetting envs? If yes, then there shouldn't be NaNs or I am using it wrongly. Could you please help me diagnose the problem?
Beta Was this translation helpful? Give feedback.
All reactions