-
Notifications
You must be signed in to change notification settings - Fork 1
Self-learning poker playing agent #111
Copy link
Copy link
Open
Labels
code exchange submissionCode and/or content about Temporal!Code and/or content about Temporal!triageIssues that Temporal folk need to look atIssues that Temporal folk need to look atziggy reviewedPre-screened by ZiggyBotPre-screened by ZiggyBot
Metadata
Metadata
Assignees
Labels
code exchange submissionCode and/or content about Temporal!Code and/or content about Temporal!triageIssues that Temporal folk need to look atIssues that Temporal folk need to look atziggy reviewedPre-screened by ZiggyBotPre-screened by ZiggyBot
Project link
https://github.com/rudy-pants/temporal-agent-poker/tree/main
Language
Python
Short description (max 256 chars)
Suppose you wanted a demonstration of a self-learning agent that could improve it's performance by updating it's own memory. This is what our project showcases, in a direct poker game between a human player, and an agent capable of improving itself to superhuman levels of play.
Long Description
Our project contains 3 major parts: a frontend client, a temporal worker running all backend workflows, and a memory module for agent self-learning. The agent will collect observations about the game each hand, and then compress those into memories after each game, updating it's memories and playstyle at it's own discretion.
https://drive.google.com/file/d/1kfkVUa9E40_iHv0vj2QlXWRP7ZOg2i22/view?usp=sharing
Author(s)
Rudy Banerjee, Geico
Khalid Shaikh, Apple