[NO MERGING] code release for NeurIPS 2020#128
[NO MERGING] code release for NeurIPS 2020#128alexholdenmiller wants to merge 6 commits intomainfrom
Conversation
|
@alexholdenmiller Thanks a lot for releasing the code. This is very useful indeed. Quick question: what would be the best way to extend the evaluation with a different model? I currently have a new model for NetHack and I'm wondering what codebase I should be using for having a fair evaluation with your NeurIPS results. In a nutshell: if I want to claim that I'm SOTA on NetHack would it be enough to run (with my custom model) the I believe at this stage would be extremely useful to have something documented so that this is crystal clear. Thanks a lot! |
|
@aleSuglia thanks for reaching out! that's reasonable yes, though if you have the resources you may want to rerun the baseline as well due to a few changed params (or you can change them back) vs the paper. I'm rerunning experiments using these params here and will publish the results in the README in this folder. here are the changed params, which I will also be adding to the README here:
|
82c865e to
0596efd
Compare
This PR will be closed and is only here to highlight the code available within this branch.
Here we release updated code to get results competitive with our NeurIPS 2020 paper.
To be clear, this is not the exact code used for the paper: we made a number of performance improvements to NLE since the original results, dramatically increasing the speed of the environment (which was already one of the fastest-performing environments when the paper was published!).
We also introduced some additional modelling options, including conditioning the model on the in-game messages (i.e. msg.model=lt_cnn) and introducing new ways of observing the environment through different glyph types (i.e. glyph_type=all_cat). These features are enabled by default for the model now, which outperforms the models in the paper.