Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Galeros
/
ppo-LunarLander-v2
like
0
Reinforcement Learning
stable-baselines3
TensorBoard
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
c59881d
ppo-LunarLander-v2
/
galeos_model_lander_ppo
2 contributors
History:
3 commits
Galeros
PPO with 3e6 iterations
81e6c97
over 2 years ago
_stable_baselines3_version
Safe
5 Bytes
PPO with 3e6 iterations
over 2 years ago
data
Safe
21.8 kB
PPO with 3e6 iterations
over 2 years ago
policy.optimizer.pth
Safe
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
What is a pickle import?
84.9 kB
LFS
PPO with 3e6 iterations
over 2 years ago
policy.pth
Safe
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
43.2 kB
LFS
PPO with 3e6 iterations
over 2 years ago
pytorch_variables.pth
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
431 Bytes
LFS
Upload model trained with PPO for 5e5 steps
over 2 years ago
system_info.txt
Safe
211 Bytes
PPO with 3e6 iterations
over 2 years ago