Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
THUDM
/
LongReward-glm4-9b-DPO
like
1
Follow
Knowledge Engineering Group (KEG) & Data Mining at Tsinghua University
1,144
Text Generation
Transformers
Safetensors
THUDM/LongReward-10k
English
Chinese
glm
chatglm
conversational
arxiv:
2410.21252
License:
glm-4
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
3fecc59
LongReward-glm4-9b-DPO
4 contributors
History:
2 commits
zR
test
3fecc59
11 days ago
.gitattributes
Safe
1.52 kB
initial commit
11 days ago
LICENSE
Safe
6.49 kB
test
11 days ago
README.md
Safe
1.76 kB
test
11 days ago
README_zh.md
Safe
2.98 kB
test
11 days ago