Galeros
/

ppo-LunarLander-v2

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Model card Files Files and versions Metrics Training metrics Community

ppo-LunarLander-v2 / galeos_model_lander_ppo

2 contributors

History: 3 commits

Galeros's picture

PPO with 3e6 iterations

81e6c97 over 2 years ago

_stable_baselines3_version

5 Bytes

PPO with 3e6 iterations over 2 years ago
data

21.8 kB

PPO with 3e6 iterations over 2 years ago
policy.optimizer.pth
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "torch.FloatStorage"
What is a pickle import?
84.9 kB
LFS

PPO with 3e6 iterations over 2 years ago
policy.pth
Detected Pickle imports (3)
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict"
What is a pickle import?
43.2 kB
LFS

PPO with 3e6 iterations over 2 years ago
pytorch_variables.pth
Pickle imports
- No problematic imports detected
What is a pickle import?
431 Bytes
LFS

Upload model trained with PPO for 5e5 steps over 2 years ago
system_info.txt

211 Bytes

PPO with 3e6 iterations over 2 years ago