#

vlm-r1

Here are 2 public repositories matching this topic...

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

reinforcement-learning vlm multimodal llm qwen deepseek-r1 grpo r1-zero vlm-r1 multimodal-r1

Updated Apr 11, 2025
Python

yeyimilk / CrowdVLM-R1

Proposed fuzzy reward model with GRPO to improve VLM's abilities in crowd counting task.

reinforcement-learning vlm crowdcounting llm reward-model r1-zero vlm-r1 multimodal-r1

Updated Apr 11, 2025
Python

Improve this page

Add a description, image, and links to the vlm-r1 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vlm-r1 topic, visit your repo's landing page and select "manage topics."