arxiv:2405.11143
Jian Hu
chuyi777
AI & ML interests
Reinforcement Learning
Recent Activity
liked
a model
24 days ago
O1-OPEN/OpenO1-LLama-8B-v0.1
updated
a model
29 days ago
OpenRLHF/Mistral-7b-PRM-Math-Shepherd
New activity
29 days ago
OpenRLHF/Mistral-7b-PRM-Math-Shepherd:怎么下载模型呢?
Organizations
Papers
1
models
None public yet
datasets
None public yet