Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
fix(autogptq): do not use_triton with qwen-vl (#1985)
* Enhance autogptq backend to support VL models * update dependencies for autogptq * remove redundant auto-gptq dependency * Convert base64 to image_url for Qwen-VL model * implemented model inference for qwen-vl * remove user prompt from generated answer * fixed write image error * fixed use_triton issue when loading Qwen-VL model --------- Co-authored-by: Binghua Wu <[email protected]>
- Loading branch information