Skip to content

MedReason #31

@warner-benjamin

Description

@warner-benjamin

Implement verifiers env for MedReason

Implement a verifiers environment for the MedReason dataset. Use the authors’ question, system, and LLM-as-judge prompts if provided. Only create a training dataset if a training split exists.

Paper: https://arxiv.org/abs/2504.00993
Dataset / Project: https://huggingface.co/datasets/UCSC-VLAA/MedReason

Format & Env

  • Mixed (some multiple-choice, some open-ended):
    • MC tasks: accuracy-based evaluation
    • Open-ended reasoning: vf.JudgeRubric()

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions