Toggle navigation
JUPYTER
FAQ
View on GitHub
Execute on Binder
master
branch
changes-1
changes-2
changes-3
course-4-1
course-4
first-course
master
third-course
tag
Reinforcement-Learning-Specialization
Prediction and Control with Function Approximation
Week 4
Notebook: Average Reward Softmax Actor-Critic using Tile-coding
results
Name
..
ActorCriticSoftmax_tilings_32_tiledim_8_actor_ss_0.25_critic_ss_2_avg_reward_ss_0.015625_exp_avg_reward.npy
ActorCriticSoftmax_tilings_32_tiledim_8_actor_ss_0.25_critic_ss_2_avg_reward_ss_0.015625_total_return.npy