DeepSpeedPlugin: non-float16 metrics #7476
Unanswered
leezu
asked this question in
DDP / multi-GPU / multi-node
Replies: 1 comment 5 replies
-
we have introduced the float16 metrics in v0.3.0 |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
PL DeepSpeedPlugin passes the whole LightningModule to the DeepSpeedEngine. DeepSpeedEngine casts the model provided to float16 during initialization. This includes casting the floating point states of torchmetrics associated with the module.
It would be helpful to allow using non-float16 states for metrics together with DeepSpeedPlugin.
cc @SeanNaren
Beta Was this translation helpful? Give feedback.
All reactions