[Model Runner][Performance] Cache the jugement result of is_encoder_decoder to decrease framework overhead #127
image.yml
on: pull_request
vllm-ascend image
29m 46s
Artifacts
Produced during runtime
Name | Size | |
---|---|---|
vllm-project~vllm-ascend~M8PBMH.dockerbuild
|
121 KB |
|