[BugFix]add int8 cache dtype when using attention quantization · vllm-project/vllm-ascend@ad7e2da

Triggered via pull request February 21, 2025 02:11

Angazenn

opened #128

Angazenn:bug_fix

Status Failure

Total duration 2m 51s

Artifacts –

mypy.yaml

on: pull_request

Matrix: mypy

8 errors

mypy (3.10): vllm_ascend/worker.py#L112

Name "cache_config" is not defined [name-defined]

Process completed with exit code 1.

The job was canceled because "_3_10" failed.

The operation was canceled.

The job was canceled because "_3_10" failed.

The operation was canceled.

The job was canceled because "_3_10" failed.

The operation was canceled.