Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Implement MBPO algorithm #579

Closed
dtch1997 opened this issue Sep 22, 2021 · 2 comments
Closed

[Feature Request] Implement MBPO algorithm #579

dtch1997 opened this issue Sep 22, 2021 · 2 comments
Labels
enhancement New feature or request

Comments

@dtch1997
Copy link

Important Note: We do not do technical support, nor consulting and don't answer personal questions per email.
Please post your question on the RL Discord, Reddit or Stack Overflow in that case.

🚀 Feature

I would like to implement a model-based RL algorithm, MBPO proposed here.

Motivation

The proposed algorithm claims to be simpler and up to 10x as sample efficient as some other baselines like SAC.
This would be helpful in my own work too.

### Checklist

  • [ x] I have checked that there is no similar issue in the repo (required)
@dtch1997 dtch1997 added the enhancement New feature or request label Sep 22, 2021
@Miffyli
Copy link
Collaborator

Miffyli commented Sep 22, 2021

This should be discussed in the contrib repo. Can you open up an issue here?

@dtch1997
Copy link
Author

dtch1997 commented Sep 27, 2021

Okay I have opened an issue in the contrib repo here: Stable-Baselines-Team/stable-baselines3-contrib#43
Closing this issue now

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants