Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

QS8-IGEMM support NR for Hexagon HVX #8219

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

copybara-service[bot]
Copy link
Contributor

QS8-IGEMM support NR for Hexagon HVX

  • Allow NR=32, 64 or 128
  • Remove acc2 main loop

QS8MobileNetV2 is 3.348x faster:
Now xnn_qs8_qc8w_igemm_minmax_fp32_ukernel_8x32c4__hvx
QS8MobileNetV2/real_time 280613 us 280614 us

Was scalar
QS8MobileNetV2/real_time 939510 us 939512 us

- Allow NR=32, 64 or 128
- Remove acc2 main loop

QS8MobileNetV2 is 3.348x faster:
Now xnn_qs8_qc8w_igemm_minmax_fp32_ukernel_8x32c4__hvx
QS8MobileNetV2/real_time           280613 us       280614 us

Was scalar
QS8MobileNetV2/real_time           939510 us       939512 us

PiperOrigin-RevId: 744774232
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant