[BugFix]add int8 cache dtype when using attention quantization · vllm-project/vllm-ascend@09c7eaf

Triggered via pull request February 21, 2025 02:20

Angazenn

synchronize #128

Angazenn:bug_fix

Status Success

Total duration 42m 5s

Artifacts 1

image.yml

on: pull_request

Produced during runtime

Name	Size
vllm-project~vllm-ascend~9D5GX2.dockerbuild	175 KB