-
Notifications
You must be signed in to change notification settings - Fork 35
Open
Description
Implement verifiers env for MedReason
Implement a verifiers environment for the MedReason dataset. Use the authors’ question, system, and LLM-as-judge prompts if provided. Only create a training dataset if a training split exists.
Paper: https://arxiv.org/abs/2504.00993
Dataset / Project: https://huggingface.co/datasets/UCSC-VLAA/MedReason
Format & Env
- Mixed (some multiple-choice, some open-ended):
- MC tasks: accuracy-based evaluation
- Open-ended reasoning:
vf.JudgeRubric()
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels