-
Notifications
You must be signed in to change notification settings - Fork 47
Description
Please describe the purpose of the feature. Is it related to a problem?
Add support for goal conditioned reinforcement learning algorithms, such as https://arxiv.org/pdf/2206.07568
Describe the solution you'd like
I am willing to implement contrastive reinforcement learning myself, but I am hoping for some guidance from the author on how best to integrate this with the rest of the library. Namely, a standardized way we should be treating goals to allow for the most smooth integration across environments.
Describe alternatives you've considered
I have implemented my own version with a wrapper around the desired environments that returns goals within the extras of a timestep. However, I am currently very unsatisfied with the integration.
How do we know when implementation of this feature is complete?
Checklist:
- Working implementation of monte-carlo contrastive reinforcement learning algorithm
Additional context
The goal is dependent on the environment so a wrapper for each environment seems inevitable. However, there is also a need for new evaluators that take a goal and maybe other considerations.