Learning to Incentivize Others

This is the code for experiments in the paper Learning to Incentivize Other Learning Agents. Baselines are included.

Setup

Python 3.6
Tensorflow >= 1.12
OpenAI Gym == 0.10.9
Clone and pip install Sequential Social Dilemma, which is a fork from the original open-source implementation.
Clone and pip install LOLA if you wish to run this baseline.
Clone this repository and run $ pip install -e . from the root.

Navigation

alg/ - Implementation of LIO and PG/AC baselines
env/ - Implementation of the Escape Room game and wrappers around the SSD environment.
results/ - Results of training will be stored in subfolders here. Each independent training run will create a subfolder that contains the final Tensorflow model, and reward log files. For example, 5 parallel independent training runs would create results/cleanup/10x10_lio_0,...,results/cleanup/10x10_lio_4 (depending on configurable strings in config files).
utils/ - Utility methods

Examples

Train LIO on Escape Room

Set config values in alg/config_room_lio.py
cd into the alg folder
Execute training script $ python train_multiprocess.py lio er. Default settings conduct 5 parallel runs with different seeds.
For a single run, execute $ python train_lio.py er.

Train LIO on Cleanup

Set config values in alg/config_ssd_lio.py
cd into the alg folder
Execute training script $ python train_multiprocess.py lio ssd.
For a single run, execute $ python train_ssd.py.

Citation

@article{yang2020learning,
  title={Learning to incentivize other learning agents},
  author={Yang, Jiachen and Li, Ang and Farajtabar, Mehrdad and Sunehag, Peter and Hughes, Edward and Zha, Hongyuan},
  journal={Advances in Neural Information Processing Systems},
  volume={33},
  pages={15208--15219},
  year={2020}
}

License

See LICENSE.

SPDX-License-Identifier: MIT

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
lio		lio
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning to Incentivize Others

Setup

Navigation

Examples

Train LIO on Escape Room

Train LIO on Cleanup

Citation

License

About

Uh oh!

Releases

Packages

Languages

License

011235813/lio

Folders and files

Latest commit

History

Repository files navigation

Learning to Incentivize Others

Setup

Navigation

Examples

Train LIO on Escape Room

Train LIO on Cleanup

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages