Jupyter Notebook Viewer

Name
Pegah-Ardehkhani's repositories
01. Epsilon Greedy
02. Optimistic Initial Values
03. UCB1
04. Bayesian Bandit Thompson Sampling
05. Iterative Policy Evaluation
06. Policy Iteration
07. Value Iteration
08. Monte Carlo
08. TD(0)
09. TD(λ)
10. SARSA
11. SARSA(λ)
12. Q-Learning
13. Deep Q-Learning
LICENSE
README.md