Skip to content

Commit 243c6f1

Browse files
authored
Add AMXTRANSPOSE detection (#156)
cpuid: Add AMXTRANSPOSE detection Signed-off-by: Oleg Zhurakivskyy <oleg.zhurakivskyy@intel.com>
1 parent 22ab8b9 commit 243c6f1

3 files changed

Lines changed: 221 additions & 217 deletions

File tree

README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -285,6 +285,7 @@ Exit Code 1
285285
| AMXCOMPLEX | Tile computational operations on complex numbers |
286286
| AMXTILE | Tile architecture |
287287
| AMXTF32 | Matrix Multiplication of TF32 Tiles into Packed Single Precision Tile |
288+
| AMXTRANSPOSE | Tile multiply where the first operand is transposed |
288289
| APX_F | Intel APX |
289290
| AVX | AVX functions |
290291
| AVX10 | If set the Intel AVX10 Converged Vector ISA is supported |

cpuid.go

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -85,6 +85,7 @@ const (
8585
AMXTILE // Tile architecture
8686
AMXTF32 // Tile architecture
8787
AMXCOMPLEX // Matrix Multiplication of TF32 Tiles into Packed Single Precision Tile
88+
AMXTRANSPOSE // Tile multiply where the first operand is transposed
8889
APX_F // Intel APX
8990
AVX // AVX functions
9091
AVX10 // If set the Intel AVX10 Converged Vector ISA is supported
@@ -1290,6 +1291,7 @@ func support() flagSet {
12901291
// CPUID.(EAX=7, ECX=1).EDX
12911292
fs.setIf(edx1&(1<<4) != 0, AVXVNNIINT8)
12921293
fs.setIf(edx1&(1<<5) != 0, AVXNECONVERT)
1294+
fs.setIf(edx1&(1<<6) != 0, AMXTRANSPOSE)
12931295
fs.setIf(edx1&(1<<7) != 0, AMXTF32)
12941296
fs.setIf(edx1&(1<<8) != 0, AMXCOMPLEX)
12951297
fs.setIf(edx1&(1<<10) != 0, AVXVNNIINT16)

0 commit comments

Comments
 (0)