Understanding the Significance of Loss Change in Conservative Q-Learning Training #70

suiniara · 2024-05-15T04:26:20Z

suiniara
May 15, 2024

Hello Cryolite,

Is the change in loss during Conservative Q-Learning training of any reference value?
In my attempts at training, the loss in CQL is always increasing. It remains very stable until 1 million training steps, after which it starts to increase linearly.

Thank you.

suiniara · 2024-05-25T13:20:39Z

suiniara
May 25, 2024
Author

I understand that there is an error in the reward function I wrote.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Understanding the Significance of Loss Change in Conservative Q-Learning Training #70

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Understanding the Significance of Loss Change in Conservative Q-Learning Training #70

Uh oh!

suiniara May 15, 2024

Replies: 1 comment

Uh oh!

suiniara May 25, 2024 Author

suiniara
May 15, 2024

suiniara
May 25, 2024
Author