This doc includes details on some desired features.
Right now Gromax can create a bash script for benchmarking and analyze results, but can't directly execute the benchmarks. An execution framework ("gromax execute") would remove the need for a separate bash script, and would allow cool features like randomization of execution order to reduce hardware bias. Interfacing with the gmx API may be feasible here, though not compatible with older Gromacs versions.
This will be useful for anyone wanting to do some in-depth analysis of the various factors leading to optimal Gromacs performance
If gromax is installed on a computer that will be benchmarked, it would be cool to have Gromax autodetect the hardware rather than have manual input.
Gromacs can stride CPU cores - such as skipping every second logical core, which can have effects in some hyperthreaded CPU layouts. Gromax has experimental support for this but hasn't been validated properly.