CareQA env #33 by Arya-Hari · Pull Request #48 · MedARC-AI/med-lm-envs

Arya-Hari · 2025-10-11T07:41:08Z

Added environment for the CareQA dataset (#33).

CLAassistant · 2025-10-11T07:41:14Z

All committers have signed the CLA.

warner-benjamin

Thanks for the PR. A few changes needed before it can be merged.

Assuming the authors don't state what their prompts are (I did a quick search and didn't find anything), we want to default to using verifiers' BOXED_SYSTEM_PROMPT and THINK_BOXED_SYSTEM_PROMPT for reasoning models. verifiers has a boxed format parser extract_boxed_answer in verifiers.utils.data_utils, and verifiers.ThinkParser to extract the answers from a reasoning model. Make sure to add the use_think: bool = False boolean flag so the user can opt into using the thinking prompt and parser. You can see an example of this in #19.

The LLM as a Judge implementation is incomplete and needs to be finished.

environments/careqa_openended/careqa_openended.py

environments/careqa_mcq/careqa_mcq.py

Arya-Hari added 2 commits October 11, 2025 13:08

Add careqa mcq eval environment

8201912

Merge branch 'main' of https://github.com/MedARC-AI/med-lm-envs

9868f31

Arya-Hari added 2 commits October 11, 2025 16:19

add careqa open-ended env

a19ba97

add careqa open-ended env

ec93a0f

Arya-Hari marked this pull request as ready for review October 11, 2025 10:55

Arya-Hari added 2 commits October 11, 2025 16:26

resolving issues

61eed34

removing redundant imports

f311014

warner-benjamin requested changes Oct 15, 2025

View reviewed changes

environments/careqa_openended/careqa_openended.py Outdated Show resolved Hide resolved

environments/careqa_mcq/careqa_mcq.py Outdated Show resolved Hide resolved

Arya-Hari and others added 8 commits October 17, 2025 13:18

resolving comments

f9f321a

resolving commits

8a04ba9

resolving comments

f0018ab

Update careqa_openended.py

b5da66a

Update careqa_openended.py

013b0d6

Update careqa_openended.py

d76f860

Merge branch 'main' into pr/Arya-Hari/48

f412b30

update careqa to use No Free Labels style prompt

3682c60

warner-benjamin merged commit 70f7acd into MedARC-AI:main Dec 12, 2025
1 check passed

warner-benjamin mentioned this pull request Dec 12, 2025

CareQA (Closed and Open-Ended English) #33

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CareQA env #33#48

CareQA env #33#48
warner-benjamin merged 14 commits intoMedARC-AI:mainfrom
Arya-Hari:main

Arya-Hari commented Oct 11, 2025

Uh oh!

CLAassistant commented Oct 11, 2025 •

edited

Loading

Uh oh!

warner-benjamin left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Arya-Hari commented Oct 11, 2025

Uh oh!

CLAassistant commented Oct 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

warner-benjamin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CLAassistant commented Oct 11, 2025 •

edited

Loading