[BugFix] add int8 cache dtype && modify initialization of attention #128
image.yml
on: pull_request
vllm-ascend image
27m 38s
Artifacts
Produced during runtime
Name | Size | |
---|---|---|
vllm-project~vllm-ascend~VJ8EWA.dockerbuild
|
202 KB |
|