-
Notifications
You must be signed in to change notification settings - Fork 754
Open
Labels
feature requestFunctionality does not currently exist, would need to be created as a new feature (type)Functionality does not currently exist, would need to be created as a new feature (type)
Description
🌱 Describe Feature Request
I trained a Transformer model. When I converted it as a whole into an mlmodel, I found that its intelligence could only be processed on the cpu. After splitting it into encode and decode, I discovered that encode could be normally accelerated using the NPU, but decode could only be processed on the cpu. Is it because decode is self-decoding, not a static issue? If decode can be accelerated by NPU, could a method for converting pt to mlmodel or mlpackage be provided
thanks
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
feature requestFunctionality does not currently exist, would need to be created as a new feature (type)Functionality does not currently exist, would need to be created as a new feature (type)