Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
OpenRLHF
/
Mistral-7b-PRM-Math-Shepherd
like
1
Follow
OpenRLHF
16
Safetensors
mistral
Model card
Files
Files and versions
Community
1
Train
main
Mistral-7b-PRM-Math-Shepherd
/
README.md
chuyi777
Update README.md
41d1ad8
verified
29 days ago
preview
code
|
raw
Copy download link
history
blame
contribute
delete
Safe
173 Bytes
Process Reward Model trained by OpenRLHF
Dataset: Math-Shepherd (
https://huggingface.co/datasets/peiyi9979/Math-Shepherd
)
Learning Rate: 1e-6
Training Accuracy: 0.922