DreamerV3

Want to do well on Atari100k (pip install gym[atari] autorom[accept-rom-license]), though BSuite (pip install bsuite) looks interesting too.

This is designed to run on a tinybox, either red or green, with just ./train.py

Process

Run https://github.com/danijar/dreamerv3 to train a model that plays Pong
Get that model loaded into tinygrad and running, both the policy model and decoder
Get fine tuning working
Get full training working

Might be a better choice, the repo is a lot easier to read. https://github.com/vmicheli/delta-iris

Three models:

actor_critic (two copies, model and target_model)
world_model
- transformer takes in (frames_emb x1, act_tokens_emb x1, latents_emb x4) x many
- frame_cnn (FrameEncoder), output 4 channels
tokenizer
- frame_cnn (FrameEncoder), output 16 channels
- encoder is 7 channels, 3 for prev_frame, 1 for action, and 3 for frame (FrameEncoder), output 64 channels for quantizer
- decoder is 84 channels, 16 for prev_frame, 4 for action, and 64 for latents. it outputs an image (FrameDecoder)
- quantizer

Training:

Our training strategy is to reproduce each one in reverse.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
di		di
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dreamer_model.py		dreamer_model.py
iris.py		iris.py
model.py		model.py
ruff.toml		ruff.toml
test_compare_delta_iris.py		test_compare_delta_iris.py
train.py		train.py