I introduced various multiarmed bandits algorithms such as e-greedy, annealing epsilon greedy, thompson sampling, UCB etc. I also compared the performance of these algorithms and how they can quickly find the best arm.
babaniyi/bandits
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|