README.md · sun1638650145/Reinforce-Pixelcopter-PLE-v0 at main

metadata

tags:
  - Pixelcopter-PLE-v0
  - reinforce
  - reinforcement-learning
  - custom-implementation
  - deep-rl-class
model-index:
  - name: Reinforce-Pixelcopter-PLE-v0
    results:
      - metrics:
          - type: mean_reward
            value: 13.30 +/- 9.12
            name: mean_reward
        task:
          type: reinforcement-learning
          name: reinforcement-learning
        dataset:
          name: Pixelcopter-PLE-v0
          type: Pixelcopter-PLE-v0

  # 使用**Reinforce**智能体来玩**Pixelcopter-PLE-v0**
  这是一个使用**Reinforce**训练有素的模型玩**Pixelcopter-PLE-v0**.
  要学习使用这个模型并训练你的模型, 请查阅深度强化学习课程第5单元: https://github.com/huggingface/deep-rl-class/tree/main/unit5