Pegah-Ardehkhani's
repositories
|
01. Epsilon Greedy
|
02. Optimistic Initial Values
|
03. UCB1
|
04. Bayesian Bandit Thompson Sampling
|
05. Iterative Policy Evaluation
|
06. Policy Iteration
|
07. Value Iteration
|
08. Monte Carlo
|
08. TD(0)
|
09. TD(λ)
|
10. SARSA
|
11. SARSA(λ)
|
12. Q-Learning
|
13. Deep Q-Learning
|
LICENSE
|
README.md
|