-
Notifications
You must be signed in to change notification settings - Fork 408
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support Sparse Inference for RVV #8081
base: master
Are you sure you want to change the base?
Conversation
@oliIMG @ken-unger If you don't mind, I believe we should drop #7116 since this PR is aligned better to current upstream status. |
No issues from me @NonerKao. I'm not sure if @oliIMG is contributing here any longer and as you say #7116 is stale now. I'm doing likewise with some of the stale Q* rvv PRs. Good to have your new PR and I'm happy to take a look at the changes, but I'm not a maintainer and so can't do any approvals. |
Hi @dsharlet , would you mind taking a look when you get a chance? |
c313deb
to
0d03661
Compare
The baseline implementation of SPMM kernel was mostly inherited PR google#7116. This patch futher simplifies the leftover handling, aligns the template and naming to other rvv kernels, and also contains miscellaneous fixes according to recent refactor.
0d03661
to
7de15ab
Compare
@dsharlet Sorry for the inconvenience, but here are some updates:
All tests for these kernels can be generated. Also they pass: 39 for |
Hi @dsharlet Any concerns on this PR? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the PR. In the future, we'd prefer to see changes like this broken up into smaller changes, this looks like it should be at least 3 separate PRs (conv, dwconv, spmm).
@dsharlet we'll need your help to merge this PR. Thank you. |
-- 60d5e0b by Gary Yi-Hung Chen <[email protected]>: Add RVV f32 SPMM kernel The baseline implementation of SPMM kernel was mostly inherited PR #7116. This patch futher simplifies the leftover handling, aligns the template and naming to other rvv kernels, and also contains miscellaneous fixes according to recent refactor. -- f7e7eba by Gary Yi-Hung Chen <[email protected]>: Add RVV f32-conv-hwc2chw-3x3s2p1c3 kernel -- 7de15ab by Gary Yi-Hung Chen <[email protected]>: Add RVV f32-dwconv2d-chw kernel FUTURE_COPYBARA_INTEGRATE_REVIEW=#8081 from NonerKao:dev-alan-contribute-spmm 7de15ab PiperOrigin-RevId: 745899104
Hi @dsharlet and @ken-unger Thanks for pushing this patch further. Since the test reveals a failed case, are we supposed to fix that fail?
|
No description provided.