-
Notifications
You must be signed in to change notification settings - Fork 105
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
hipblaslt flow support fp8 swizzle #1648
Conversation
09c3cea
to
2bbe58f
Compare
UseInitialStridesAB: false | ||
UseInitialStridesCD: false | ||
UseScaleAB: Scalar | ||
UseScaleAlphaVec: 0 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
should enable SAV for gridbase
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fixed, thx.
2bbe58f
to
c0c89c9
Compare
clients/gtest/matmul_gtest.yaml
Outdated
transB: N | ||
alpha: 1 | ||
beta: [ 0.0, 2.0 ] | ||
scaleA: [2] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
scaleA: [2] is scale vector, which should not be used.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
both scaleA:[2] and scaleB:[2] have been deleted, thanks.
clients/gtest/matmul_gtest.yaml
Outdated
bias_vector: [0, 1] | ||
bias_type: f16_r | ||
unit_check: 1 | ||
gpu_arch: '94[0-2]' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
942 only
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fixed, thanks
c0c89c9
to
13cb63b
Compare
UseE: false | ||
UseInitialStridesAB: false | ||
UseInitialStridesCD: false | ||
UseScaleAB: Vector |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We need to support Scalar but not Vector
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fixed, thanks
clients/gtest/matmul_gtest.yaml
Outdated
M: [128] | ||
N: [128] | ||
K: [128] | ||
swizzle_a: true |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"swizzle_a: true" has been set in real_precisions_swizzleA_support, isn't it?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes, the redundant ones have been removed.
638b95c
to
dc3f395
Compare
dc3f395
to
5170880
Compare
clients/gtest/matmul_gtest.yaml
Outdated
category: pre_checkin | ||
function: | ||
matmul: *real_precisions_swizzleA_support | ||
M: [128] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add some odd size
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fixed, thanks
5170880
to
0148ca9
Compare
0148ca9
to
0028758
Compare
0028758
to
c08b1cb
Compare
add logic yamls, sample add gtest for fp16,fp8 swizzle assert1
c08b1cb
to
8eaf586
Compare
[----------] Global test environment tear-down
[==========] 53119 tests from 13 test suites ran. (6789418 ms total)
[ PASSED ] 53119 tests.
./build/release/clients/staging/hipblaslt-bench -r f16_r --swizzleA -m 128 -n 128 -k 128 -v --transA T --transB N --initialization rand_int
T,N,0,1,128,128,128,1,128,16384,0,128,16384,128,16384,128,16384,f16_r,f16_r,f16_r,f16_r,f32_r,0,0,0,0,0,none,0,f16_r,391.991,8.55633,10.7,0.0161795,259235,0,1e-05,1e-05
./build/release/clients/staging/hipblaslt-bench -r f16_r --swizzleA -m 129 -n 129 -k 129 -v --transA T --transB N --initialization rand_int
T,N,0,1,129,129,129,1,129,16641,0,129,16641,129,16641,129,16641,f16_r,f16_r,f16_r,f16_r,f32_r,0,0,0,0,0,none,0,f16_r,41.3222,1.0097,103.9,0.0174304,246316,0,1e-05,1e-05
./build/release/clients/staging/hipblaslt-bench -r f8_fnuz_r --c_type f16_r --d_type f16_r --swizzleA -m 128 -n 128 -k 128 -v --transA T --transB N --initialization rand_int
T,N,0,1,128,128,128,1,128,16384,0,128,16384,128,16384,128,16384,f8_fnuz_r,f8_fnuz_r,f16_r,f16_r,f32_r,0,0,0,0,0,none,0,f16_r,364.722,5.3074,11.5,0.0155591,269572,0,1e-05,1e-05
./build/release/clients/staging/hipblaslt-bench -r f8_fnuz_r --c_type f16_r --d_type f16_r --swizzleA -m 129 -n 129 -k 129 -v --transA T --transB N --initialization rand_int
T,N,0,1,129,129,129,1,129,16641,0,129,16641,129,16641,129,16641,f8_fnuz_r,f8_fnuz_r,f16_r,f16_r,f32_r,0,0,0,0,0,none,0,f16_r,248.172,4.17593,17.3,0.0113768,377380,0,1e-05,1e-05