Replies: 1 comment 2 replies
-
those A and B values are read by more than just one thread. We simply show the first one. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
For gtc2025 slides example:
I think thread[0,1,...7] holds MxK Amatrix, and thread[0, 8, 16] hold another matrix, and I'm not very clear that why output matrix looks like this.
In the black box T4V0 and T32V0 generate output T36V0, does it mean the output data hold in thread36 V0?
Beta Was this translation helpful? Give feedback.
All reactions