Why split matrix A/B between CTAs for 2SM MMA instruction? Code and documentation seem contradictory? #2169
-
Context: I’m trying to understand how matrices A and B are stored in shared memory for the 2SM MMA instruction, but the code and documentation seem contradictory: Code
Documentation
Question: Am I misunderstanding something about how the 2SM MMA instruction works? Any clarification on how the data is stored and used or how the full result is obtained would be greatly appreciated. Thanks! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
The two CTAs are indeed cooperating to produce the full MxN tile.
Where is the documentation incorrect? Your link does not point to a specific line. |
Beta Was this translation helpful? Give feedback.
The two CTAs are indeed cooperating to produce the full MxN tile.
Where is the documentation incorrect? Your link does not point to a specific line.