Skip to content

feat(ml,mlebench): switch mlebench to OOF-driven eval and improve score workflow#48

Merged
hundredwz merged 1 commit intobaidu-baige:mainfrom
hundredwz:feature/update_mlagent_evaluate
Mar 2, 2026
Merged

feat(ml,mlebench): switch mlebench to OOF-driven eval and improve score workflow#48
hundredwz merged 1 commit intobaidu-baige:mainfrom
hundredwz:feature/update_mlagent_evaluate

Conversation

@hundredwz
Copy link
Copy Markdown
Collaborator

What problem does this PR solve?

Issue Number: resolve

Problem Summary:

switch mlebench to OOF-driven eval and improve score workflow

What is changed and the side effects?

Changed:

  • Switched MLE-Bench evaluation to OOF-driven scoring
  • Updated OOF/workflow contract and validation

Side effects:

  • Performance effects:

  • Breaking backward compatibility:


Check List:

@hundredwz hundredwz merged commit d2e104c into baidu-baige:main Mar 2, 2026
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants