Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
THUDM
/
LongReward-glm4-9b-DPO
like
1
Follow
Knowledge Engineering Group (KEG) & Data Mining at Tsinghua University
1,145
Text Generation
Transformers
Safetensors
THUDM/LongReward-10k
English
Chinese
glm
chatglm
conversational
arxiv:
2410.21252
License:
glm-4
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
a5d184e
LongReward-glm4-9b-DPO
Commit History
Update README_zh.md
a5d184e
verified
NeoZ123
commited on
10 days ago
Update README.md
56cf38d
verified
NeoZ123
commited on
10 days ago
Update README_zh.md
5cd4f53
verified
NeoZ123
commited on
10 days ago
Update README.md
96f64b0
verified
NeoZ123
commited on
10 days ago
readme
67ce66f
zR
commited on
11 days ago
remove
44fb424
Ubuntu
commited on
11 days ago
upload model for transformers>=4.46
fd1f187
Ubuntu
commited on
11 days ago
new
ee5f6cf
zR
commited on
11 days ago
test
3fecc59
zR
commited on
11 days ago
initial commit
0bf839d
verified
zRzRzRzRzRzRzR
commited on
11 days ago