Reinforce-PongPolicyGradient / hyperparameters.json
jackoyoungblood's picture
Reinforce-PongPolGrad-1k training episodes
ca8ebc3
raw
history blame contribute delete
175 Bytes
{"h_size": 64, "n_training_episodes": 1000, "n_evaluation_episodes": 12, "max_t": 5500, "gamma": 0.98, "lr": 0.1, "env_id": "Pong-PLE-v0", "state_space": 7, "action_space": 3}