QS8 gemm use V73 int to float and multiply for quantization #8242

copybara-service · 2025-04-10T07:36:56Z

QS8 gemm use V73 int to float and multiply for quantization

int to IEEE float is exact
hvx mpy float to qfloat
increase tolerance to difference of 1 for qfloat

- int to IEEE float is exact - hvx mpy float to qfloat - increase tolerance to difference of 1 for qfloat PiperOrigin-RevId: 747153592

copybara-service bot force-pushed the test_745880585 branch from f1c812d to 4fc0203 Compare April 13, 2025 20:54

QS8 gemm use V73 int to float and multiply for quantization

989eed0

- int to IEEE float is exact - hvx mpy float to qfloat - increase tolerance to difference of 1 for qfloat PiperOrigin-RevId: 747153592

copybara-service bot force-pushed the test_745880585 branch from 4fc0203 to 989eed0 Compare April 13, 2025 21:03

copybara-service bot merged commit 989eed0 into master Apr 13, 2025

copybara-service bot deleted the test_745880585 branch April 13, 2025 21:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QS8 gemm use V73 int to float and multiply for quantization #8242

QS8 gemm use V73 int to float and multiply for quantization #8242

copybara-service bot commented Apr 10, 2025

QS8 gemm use V73 int to float and multiply for quantization #8242

QS8 gemm use V73 int to float and multiply for quantization #8242

Conversation

copybara-service bot commented Apr 10, 2025