Add capability to perform multiple runs, possibly with a parameter sweep #35

jopetty · 2020-11-24T02:35:42Z

Currently, there is a one-to-one correspondence between issuing the python main.py ... command and producing a single model. But ideally, an experiment should encapsulate both a parameter sweep (i.e., train SRN-SRN, GRU-GRU, Transformer-Transformer, etc. models) and allow for multiple runs of any given parameter combination (i.e., for each set of hyperparameters do 10 runs so we can average performance). This means we need a way to launch multiple jobs and specify parameter sweeps in a config file.

For sweeping: Hydra includes an Ax plugin, which seems to allow for parameter sweeps to be defined in the YAML config.

For multiruns: Maybe look at the JobLib launcher plugin

The text was updated successfully, but these errors were encountered:

jopetty · 2020-11-24T03:38:03Z

Also of note, there is a plugin for the Submitit Launcher which automatically runs sessions with SLURM jobs. It would be useful to see if this could be used to submit jobs on the GRACE cluster.

jopetty · 2021-01-09T20:17:55Z

It looks like multirun support is somewhat provided out-of-the-box; using the -m flag allows one to sweep over parameters like:

python train.py model=model1,model2

but this has some problems with the custom output directory structure we've created.

jopetty · 2021-02-02T17:06:27Z

Multirun directory structure has been fixed in 33f9ce5. Now it looks like:

outputs/
  experiment/
    model/
      DATE_TIME/
        RUN_1/
        RUN_2/
        ....

jopetty added documentation Improvements or additions to documentation enhancement New feature or request v2 Version 2 (with Hydra) labels Nov 24, 2020

jopetty self-assigned this Nov 24, 2020

jopetty added this to the Stable 1.0 milestone Jan 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add capability to perform multiple runs, possibly with a parameter sweep #35

Add capability to perform multiple runs, possibly with a parameter sweep #35

jopetty commented Nov 24, 2020

jopetty commented Nov 24, 2020

jopetty commented Jan 9, 2021

jopetty commented Feb 2, 2021

Add capability to perform multiple runs, possibly with a parameter sweep #35

Add capability to perform multiple runs, possibly with a parameter sweep #35

Comments

jopetty commented Nov 24, 2020

jopetty commented Nov 24, 2020

jopetty commented Jan 9, 2021

jopetty commented Feb 2, 2021