When will you open-source the evaluation code?

We attempted to reproduce the paper's results on the video-mme dataset but were unable to achieve comparable performance. As shown in the figure, there is a significant gap in accuracy. Providing evaluation code for the dataset would greatly benefit the open-source community.

<img width="1650" height="336" alt="Image" src="https://github.com/user-attachments/assets/2bc3023b-8e21-4217-9414-72ec4939f642" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When will you open-source the evaluation code? #1

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

When will you open-source the evaluation code? #1

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions