MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision

🔥 News

MedReasoner has been accepted at AAAI 2026 as a poster!

👀 Introduction

Abstract: Accurately grounding regions of interest (ROIs) is critical for diagnosis and treatment planning in medical imaging. While multimodal large language models (MLLMs) combine visual perception with natural language, current medical-grounding pipelines still rely on supervised fine-tuning with explicit spatial hints, making them ill-equipped to handle the implicit queries common in clinical practice. This work makes three core contributions. We first define Unified Medical Reasoning Grounding (UMRG), a novel vision–language task that demands clinical reasoning and pixel-level grounding. Second, we release U-MRG-14K, a dataset of 14K samples featuring pixel-level masks alongside implicit clinical queries and reasoning traces, spanning 10 modalities, 15 super-categories, and 108 specific categories. Finally, we introduce MedReasoner, a modular framework that distinctly separates reasoning from segmentation: an MLLM reasoner is optimized with reinforcement learning, while a frozen segmentation expert converts spatial prompts into masks, with alignment achieved through format and accuracy rewards. MedReasoner achieves state-of-the-art performance on U-MRG-14K and demonstrates strong generalization to unseen clinical queries, underscoring the significant promise of reinforcement learning for interpretable medical grounding.

💾 Installation

git clone https://github.com/zzzyzh/MedReasoner.git
cd MedReasoner

conda create -n med_reasoner python=3.10 -y
conda activate med_reasoner
pip install torch==2.6.0 torchvision==0.21.0 torchaudio==2.6.0
pip install -r requirements.txt
pip install "numpy<2.0"

pip install transformers==4.52.4
pip install vllm==0.8.5.post1
pip install flash-attn==2.7.4.post1
pip install deepspeed==0.16.9

pip install llamafactory
pip install -e .

🚀 Training

bash scripts/run_rl_lingshu_7b_soft.sh

🔍 Inference

bash scripts/merge_model.sh
bash scripts/infer_grounding.sh

Citation

@article{yan2025medreasoner,
  title={MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision},
  author={Yan, Zhonghao and Diao, Muxi and Yang, Yuxuan and Jing, Ruoyan and Xu, Jiayuan and Zhang, Kaizhou and Yang, Lele and Liu, Yanxi and Liang, Kongming and Ma, Zhanyu},
  journal={arXiv preprint arXiv:2508.08177},
  year={2025}
}

Acknowledgements

This code is built on verl and Seg-Zero. We thank the authors for sharing their codes.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
eval		eval
sam_zoo		sam_zoo
scripts		scripts
verl		verl
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision

🔥 News

👀 Introduction

💾 Installation

🚀 Training

🔍 Inference

Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

PRIS-CV/MedReasoner

Folders and files

Latest commit

History

Repository files navigation

MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision

🔥 News

👀 Introduction

💾 Installation

🚀 Training

🔍 Inference

Citation

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages