-
Notifications
You must be signed in to change notification settings - Fork 182
Open
Labels
discussionDiscussion of a typical issue or conceptDiscussion of a typical issue or concept
Description
Hello, thanks for your work!
We have implemented a custom environment with discrete action spaces. We’ve observed that after reaching a certain level of performance (in terms of reward or success rate), the results begin to degrade during further training (we’re training with UniZero).
We have also encountered similar behavior with the original EfficientZero and EfficientZeroV2 repositories when running other custom environments.
Have you encountered performance degradation after a plateau? Are there any specific hyperparameters or strategies for solving it?
Thank you in advance for your response.
Metadata
Metadata
Assignees
Labels
discussionDiscussion of a typical issue or conceptDiscussion of a typical issue or concept