Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
THUDM
/
LongReward-glm4-9b-DPO
like
1
Follow
Knowledge Engineering Group (KEG) & Data Mining at Tsinghua University
1,148
Text Generation
Transformers
Safetensors
THUDM/LongReward-10k
English
Chinese
glm
chatglm
conversational
arxiv:
2410.21252
License:
glm-4
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
a5d184e
LongReward-glm4-9b-DPO
/
README.md
Commit History
Update README.md
56cf38d
verified
NeoZ123
commited on
10 days ago
Update README.md
96f64b0
verified
NeoZ123
commited on
10 days ago
readme
67ce66f
zR
commited on
11 days ago
new
ee5f6cf
zR
commited on
11 days ago
test
3fecc59
zR
commited on
11 days ago