Releases: EdanToledo/Stoix
Releases · EdanToledo/Stoix
v0.0.3
What's Changed
- chore: refactor types by @EdanToledo in #106
- Fix: Sebulba PPO Metrics by @EdanToledo in #108
- feat: add MLP torso after CNN torso by @EdanToledo in #110
- chore: add some extra alg run checks by @EdanToledo in #111
- fix: make FlattenObservationWrapper also flatten next_obs by @JesseSilverberg in #115
- Edan toledo/issue112 by @EdanToledo in #117
- Feat/add jax env factory by @EdanToledo in #118
- chore: move system config files into separate folders by @EdanToledo in #119
- Fix typos by @BioGeek in #120
- feat: add SPS for trainer by @EdanToledo in #129
- fix: logger accepts arrays as data and use tensorboard_logger Logger by @Your-Cheese in #131
- feat: add slurm launcher by @EdanToledo in #133
- fix: add specific system condition for envpool installation by @EdanToledo in #124
- chore: add uuid to uniqueid by @EdanToledo in #134
- Feat/add r2d2 by @EdanToledo in #137
- Feat/add impala by @EdanToledo in #138
- chore: Redundant duplicate tests and linters workflow in PR's by @mmcaulif in #149
- fix: issue with GAE calc and add test case by @EdanToledo in #148
- feat: add readme for algorithms by @EdanToledo in #150
- fix: change fbx api by @EdanToledo in #151
- feat: pass config env kwargs to gymnax via envparams by @emergenz in #152
- Chore/make wrappers work sebulba by @EdanToledo in #141
- feat: add spo by @EdanToledo in #144
- Improve sebulba by @EdanToledo in #161
- Fix/gae calc by @EdanToledo in #160
- fix: do not remove hydra logging handler by @emergenz in #158
- Feat/add single torso impala by @EdanToledo in #164
- feat: Migration to UV by @mmcaulif in #155
- Feat/pqn by @EdanToledo in #122
- feat: add optional obs norm to ff_ppo.py by @EdanToledo in #167
- [FIX] Fix the logger if there are no streamhandlers by @Michael-Beukman in #173
- [FEATURE] Kinetix by @Michael-Beukman in #172
- Fix the Kinetix pixels config by @Michael-Beukman in #175
- Feat/use stoa wrappers by @EdanToledo in #177
New Contributors
- @JesseSilverberg made their first contribution in #115
- @BioGeek made their first contribution in #120
- @Your-Cheese made their first contribution in #131
- @mmcaulif made their first contribution in #149
- @emergenz made their first contribution in #152
- @Michael-Beukman made their first contribution in #173
Full Changelog: v0.0.2...v0.0.3
v0.0.2
What's Changed
- fix: clip mpo actions used in q function to avoid extrapolation by @EdanToledo in #55
- chore: remove self-implemented code in favour of jumanji wrapper by @EdanToledo in #56
- fix: use of truncation in GAE calc by @EdanToledo in #57
- fix: add option to use GAE as value targets by @EdanToledo in #58
- feat: add running statistics utils modified from acme by @EdanToledo in #60
- feat: add beta distribution policy head by @EdanToledo in #63
- Chore/refactor loss metrics by @EdanToledo in #61
- Feat/add ppo penalty by @EdanToledo in #64
- chore: slight change to configs by @EdanToledo in #65
- chore: Make Update Batch Size not affect num envs, buffer size and batch size by @EdanToledo in #68
- fix: double critic being initialised to same network by @EdanToledo in #73
- Chore/refactor type by @EdanToledo in #74
- Feat/add vmpo by @EdanToledo in #75
- fix: recurrent ppo by @EdanToledo in #76
- Chore/change mpo loss by @EdanToledo in #80
- feat: add notebook to plot stoix algorithms by @EdanToledo in #87
- chore: edit readme by @EdanToledo in #88
- feat: add a weights and biases logger by @EdanToledo in #89
- fix: add nstep transitions to d4pg by @EdanToledo in #92
- Feat/rainbow by @RPegoud in #86
- Chore/change muzero networks by @EdanToledo in #93
- chore: move input of distributional network args into config by @EdanToledo in #94
- chore: edit wrappers to have a separate flatten obs wrapper by @EdanToledo in #95
- feat: generalise win rate to be solve rate by @EdanToledo in #96
- Feat/add popjym by @EdanToledo in #97
- fix: typing issues causing double compilation by @EdanToledo in #100
- Feat/add navix by @EdanToledo in #101
- Feat/Add Sebulba by @EdanToledo in #105
New Contributors
Full Changelog: v0.0.1...v0.0.2
v0.0.1
First release.