[Feature Request] Implement MBPO algorithm #579

dtch1997 · 2021-09-22T16:02:24Z

Important Note: We do not do technical support, nor consulting and don't answer personal questions per email.
Please post your question on the RL Discord, Reddit or Stack Overflow in that case.

🚀 Feature

I would like to implement a model-based RL algorithm, MBPO proposed here.

Motivation

The proposed algorithm claims to be simpler and up to 10x as sample efficient as some other baselines like SAC.
This would be helpful in my own work too.

### Checklist

[ x] I have checked that there is no similar issue in the repo (required)

Miffyli · 2021-09-22T16:14:09Z

This should be discussed in the contrib repo. Can you open up an issue here?

dtch1997 · 2021-09-27T07:51:24Z

Okay I have opened an issue in the contrib repo here: Stable-Baselines-Team/stable-baselines3-contrib#43
Closing this issue now

dtch1997 added the enhancement New feature or request label Sep 22, 2021

dtch1997 closed this as completed Sep 27, 2021

araffin mentioned this issue Apr 23, 2023

[Feature Request] AlphaZero development #1464

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Implement MBPO algorithm #579

[Feature Request] Implement MBPO algorithm #579

dtch1997 commented Sep 22, 2021

Miffyli commented Sep 22, 2021

dtch1997 commented Sep 27, 2021 •

edited

Loading

[Feature Request] Implement MBPO algorithm #579

[Feature Request] Implement MBPO algorithm #579

Comments

dtch1997 commented Sep 22, 2021

🚀 Feature

Motivation

Miffyli commented Sep 22, 2021

dtch1997 commented Sep 27, 2021 • edited Loading

dtch1997 commented Sep 27, 2021 •

edited

Loading