Intro

This repo is building off of the code from https://github.com/yawgmoth/pyhanabi for my master's research on Human-AI Collaboration

Our approach

Creating strategies to learn to collaborate with. If our agent can learn how to recognize the other player's strategy and then utilize structured knowledge of how to best coordinate with that strategy, we hypothesize that the Human-AI team can gain high rewards.

Current strategy

Building the CHIEF agent which is intended to use a pool of human-like agents that reflect different types of players (differing in conventions/strategies). By using a probability distribution over each agent in the pool representing its likelihood of best representing our teammate, which we update with bayesian updates (the conditional probability update is the probability of the observed action by the teammate given each agent -- sample possible hands and get values for actions from agents to produce this probability distribution for updates). Finally, using the most up-to-date distribution over the agent pool, we can use that representation of the teammate to tailor our response to them for better total team-reward (for now we will just mirror the agents in the pool and if that is insufficient we will look into more advanced respones models)

For questions

contact: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 366 Commits
Agents		Agents
TestFiles		TestFiles
game_net		game_net
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
common_game_functions.pyx		common_game_functions.pyx
hanabi.pyx		hanabi.pyx
httpui.py		httpui.py
serverconf.py		serverconf.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Intro

Our approach

Current strategy

For questions

About

Releases

Packages

Contributors 4

Languages

License

ArnavM1499/Hanabi-HumanAI

Folders and files

Latest commit

History

Repository files navigation

Intro

Our approach

Current strategy

For questions

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages