How to draw the score curve of PPO? #78

AceChuse · 2018-12-19T07:26:57Z

Hello,

I don't know how to draw the score curve PPO which in the paper of PPO? How to deal with the situation when the game is not over but the sample pool is full? In this cause if we end the game, it means that we cannot calculate the score which agent needs to perform more than Horizon (T) interactions with the environment to get, such as Walker2d-v1. But if we don't end the game when the sample pool is full, the number of samples maybe larger than Horizon (T).

I don't know how to deal with this problem. How do you deal with this problem in the PPO experiment? I really care about this. Thanks for your help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to draw the score curve of PPO? #78

How to draw the score curve of PPO? #78

AceChuse commented Dec 19, 2018

How to draw the score curve of PPO? #78

How to draw the score curve of PPO? #78

Comments

AceChuse commented Dec 19, 2018