ModelPruning callback - hard pruning #19347
Unanswered
ilya-SX
asked this question in
code help: CV
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi all,
I am performing a structural pruning using the pruning callback (
pytorch_lightning.callbacks.ModelPruning
) and since it performs soft pruning only (replacing weights with zeros) I see almost no difference in model latency when converted to ONNX. It seems that to reduce model latency hard pruning (removing zero weights) should be performed.My questions are:
Thanks
Beta Was this translation helpful? Give feedback.
All reactions