-
Notifications
You must be signed in to change notification settings - Fork 3
Feature/mcts lhns fixes #90
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
nikivanstein
wants to merge
58
commits into
main
Choose a base branch
from
feature/mcts-lhns-fixes
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from 55 commits
Commits
Show all changes
58 commits
Select commit
Hold shift + click to select a range
a892605
Added Multi LLM and experiment for diversity start
nikivanstein 9fffcb5
Fixing Claude connector
nikivanstein 69d092d
fixing ioh dependency
nikivanstein 7d3b5cb
Fixing sync issues due to pickling the same file in multiple threads
nikivanstein 425940e
improved error messages
nikivanstein bf154c3
Cleaning error messages
nikivanstein cee613b
Nice errors!
nikivanstein 6d154cc
fixing tokencost
nikivanstein 84e36a0
New experimental setups
nikivanstein e6c2a99
setting up full experiment
nikivanstein 1b7727a
fixing issues in exp
nikivanstein 2a9821a
working setup
nikivanstein 728242b
Added math import
nikivanstein a489f90
Added debug mode for easy testing of end to end stuff
nikivanstein eabc95e
ioh
nikivanstein 4dab9a0
Last version
nikivanstein 13360e3
Merge branch 'main' into experiment/diversity
nikivanstein ca817da
Setting up first experiment for guided llm mutation
nikivanstein 721d1c6
feedback and error handling in blade was broken..
nikivanstein 5eb5f14
lock update
nikivanstein 0c3edf8
Merge branch 'main' of https://github.com/XAI-liacs/BLADE into experi…
nikivanstein 4783e7a
black
nikivanstein 9b74fa2
Added additional plotting options for CEG
nikivanstein f11dac0
Merge https://github.com/XAI-liacs/BLADE into experiment/xaidriven
nikivanstein c335dbb
Added additional experiments
nikivanstein 18bc21f
Solving Photonics issues
nikivanstein 104c9ec
Resolving mcts errors
nikivanstein 42f70e5
Resolved issues with MCTS and cleaned up experiments
nikivanstein 4756a9c
Better generalized plotting functions and new ablation studies.
nikivanstein a5f210b
added visualisation mabbob guided
nikivanstein 2fb270b
Added operator colloring.
nikivanstein 63914dc
Updated visualisations
nikivanstein 6c7d336
small changes
nikivanstein 8d9c6d8
Added hlp problem class
nikivanstein 98dc673
hlp problem runner
nikivanstein 5f5e737
implemented run for rule driven
nikivanstein 449027a
added rules
nikivanstein 6fac909
added hlp problem
nikivanstein e8547e7
HLP experiment ready
nikivanstein a98749f
Small changes for visualisations in paper.
nikivanstein f23eafa
Small changes
nikivanstein 878a3e4
removed
nikivanstein 4615b2d
Added statistical test functions.
nikivanstein 2a8e914
cleaning up
nikivanstein ad4fcaf
clearning up notebooks
nikivanstein e1bcd46
clearning up notebooks
nikivanstein 628790f
clearning up notebooks
nikivanstein 27cca46
Cleaning up plotting code
nikivanstein a698c94
Removing specific ref for LLaMEA
nikivanstein e6cfc41
Add HLP tests and document AUC utilities (#91)
nikivanstein 94bcf7c
black
nikivanstein b7c5600
remove circular
nikivanstein e806157
Add AlphaEvolve benchmark documentation (#92)
nikivanstein bd21344
Fixed tests
nikivanstein 5fe8e87
black
nikivanstein 4914389
black
nikivanstein 4787df6
for now remove hlp
nikivanstein 5891d27
remove generated problems
nikivanstein File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,23 @@ | ||
| AlphaEvolve Benchmarks | ||
| ====================== | ||
|
|
||
| BLADE includes benchmark instances inspired by the Google DeepMind | ||
| AlphaEvolve paper. These instances are available in two complementary forms: | ||
|
|
||
| - ``run_benchmarks/`` provides standalone reference scripts for running each | ||
| task directly. | ||
| - ``iohblade/benchmarks`` packages the same tasks for programmatic use in | ||
| experiments and pipelines. | ||
|
|
||
| The packaged benchmarks are grouped by domain: | ||
|
|
||
| - Analysis (auto-correlation inequalities) | ||
| - Combinatorics (Erdos min-overlap) | ||
| - Geometry (Heilbronn problems, kissing number, and distance ratios) | ||
| - Matrix multiplication | ||
| - Number theory (sums vs differences) | ||
| - Packing (rectangle, hexagon, and unit square packing) | ||
| - Fourier (uncertainty inequalities) | ||
|
|
||
| Each domain folder contains a README with task-specific details and citations | ||
| to the original sources. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.