Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
wwymak
/
ppo-LunarLander-v2
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card
Files
Files and versions
Community
Use this model
d5487bb
ppo-LunarLander-v2
1 contributor
History:
5 commits
wwymak
lunar lander tuned, 1e6 timesteps, params: {'n_steps': 1024, 'n_epochs': 20, 'discount_factor_gamma': 0.999}
d5487bb
over 2 years ago
lunar v1
lunar lander default training, 1e6 timesteps
over 2 years ago
lunar v2
lunar lander tuned, 1e6 timesteps, params: {'n_steps': 1024, 'n_epochs': 20, 'discount_factor_gamma': 0.999}
over 2 years ago
.gitattributes
Safe
1.22 kB
lunar lander default training, 1e6 timesteps
over 2 years ago
README.md
Safe
677 Bytes
lunar lander tuned, 1e6 timesteps, params: {'n_steps': 1024, 'n_epochs': 20, 'discount_factor_gamma': 0.999}
over 2 years ago
config.json
Safe
14.5 kB
lunar lander tuned, 1e6 timesteps, params: {'n_steps': 1024, 'n_epochs': 20, 'discount_factor_gamma': 0.999}
over 2 years ago
lunar v1.zip
Safe
144 kB
LFS
lunar lander default training, 1e6 timesteps
over 2 years ago
lunar v2.zip
Safe
144 kB
LFS
lunar lander tuned, 1e6 timesteps, params: {'n_steps': 1024, 'n_epochs': 20, 'discount_factor_gamma': 0.999}
over 2 years ago
replay.mp4
Safe
202 kB
LFS
lunar lander tuned, 1e6 timesteps, params: {'n_steps': 1024, 'n_epochs': 20, 'discount_factor_gamma': 0.999}
over 2 years ago
results.json
Safe
165 Bytes
lunar lander tuned, 1e6 timesteps, params: {'n_steps': 1024, 'n_epochs': 20, 'discount_factor_gamma': 0.999}
over 2 years ago