ppo-LunarLander-v2 / lunar v1 /_stable_baselines3_version
wwymak's picture
lunar lander default training, 1e6 timesteps
4985d07
raw
history blame
5 Bytes
1.5.0