[BugFix]Add int8 cache dtype when using ascend attention quantization · vllm-project/vllm-ascend@b20bbd0

Triggered via pull request February 21, 2025 01:44

Angazenn

opened #125

Angazenn:develop

Status Success

Total duration 30m 29s

Artifacts 1

image.yml

on: pull_request

Produced during runtime

Name	Size
vllm-project~vllm-ascend~TA1VPH.dockerbuild	249 KB