Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to plot uncertainty for episodic setting ? (different random seeds lead to different x-values) #110

Open
zuoxingdong opened this issue Feb 13, 2019 · 0 comments

Comments

@zuoxingdong
Copy link

zuoxingdong commented Feb 13, 2019

When we deal with episodic environments (say CartPole), each episode terminates with different lengths in different random runs due to learning progress and randomness.

What might be the good way to plot uncertainty over different runs, and we want to have total timesteps as x-values (interactions with environment) which is the standard curve to address sample complexity in the community.

e.g. A curve like this
image

A naive solution is to fit a polynomial for each curve and generate new data points for consistent x-values, however, this might be very misleading because some of the curves might be very sparse in the data point.

What do you think @jachiam ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant