[Model] Qwen-2-VL Support #3125

nihalgeorge01 · 2025-02-10T14:59:47Z

This PR adds support for the Qwen-2-VL (vision-language) model

buqimaolvshangxue · 2025-03-07T10:10:21Z

qwen2_vl can work by this commit ？ @nihalgeorge01 ， i also has the needs to support qwen2_vl

nihalgeorge01 · 2025-03-07T13:02:39Z

Not yet, we are fixing some bugs in the code locally. Working on pushing this out soon

buqimaolvshangxue · 2025-03-07T13:20:23Z

Thank you very much for your work! When I was thinking about this problem, I found that when processing the llava model in mlc, the text embedding and the image embedding are directly spliced together to get the final embedding. But it seems that in the approach of qwen2_vl in vllm, the image embedding replaces certain specific positions in the expanded embedding. I wonder if the direct splicing method of llava is feasible? But if the splicing embedding method is not adopted, it seems that the public interface function needs to be modified. @nihalgeorge01

add model registration and other boilerplate

bf4d631

nihalgeorge01 force-pushed the qwen-2-vl branch from 86e4361 to bf4d631 Compare February 10, 2025 15:18

nihalgeorge01 changed the title ~~[MODEL] Qwen-2-VL Support~~ [Model] Qwen-2-VL Support Feb 10, 2025

Add image preproc (till before patches for ViT)

8ab0824

Add impl (WIP testing, bugfixes)

86b4de9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] Qwen-2-VL Support #3125

[Model] Qwen-2-VL Support #3125

nihalgeorge01 commented Feb 10, 2025

buqimaolvshangxue commented Mar 7, 2025

nihalgeorge01 commented Mar 7, 2025

buqimaolvshangxue commented Mar 7, 2025

[Model] Qwen-2-VL Support #3125

Are you sure you want to change the base?

[Model] Qwen-2-VL Support #3125

Conversation

nihalgeorge01 commented Feb 10, 2025

buqimaolvshangxue commented Mar 7, 2025

nihalgeorge01 commented Mar 7, 2025

buqimaolvshangxue commented Mar 7, 2025