In this project, we train a snake AI using Q-learning by updating Q-values based on predefined rewards and actions. The environment is modeled with two danger stages, four possible movement directions, and eight food directions, resulting in a total of 256 states—the minimal number of states required for this setup. After sufficient training, the snake demonstrates improved performance and decision-making capabilities.
-
Notifications
You must be signed in to change notification settings - Fork 0
Saman2C/snake_game_reinforcement_learning
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
In this project we trying to find the winning strategy using Q learning and setting the parameters properly.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published