arxiv:2410.04698
Hanze Dong
hendrydong
AI & ML interests
None yet
Recent Activity
New activity
9 days ago
RLHFlow/LLaMA3.2-1B-SFT:the training data for this model?
New activity
about 2 months ago
sfairXC/FsfairX-LLaMA3-RM-v0.1:Update README.md
updated
a model
about 2 months ago
sfairXC/FsfairX-LLaMA3-RM-v0.1
Organizations
Papers
11
models
5
hendrydong/dpo_offline_700K
Text Generation
•
Updated
•
5
hendrydong/llama3
Updated
hendrydong/dpo_K8_max_max
Text Generation
•
Updated
•
6
hendrydong/Mistral-RM-for-RAFT-GSHF-v0
Text Classification
•
Updated
•
9
•
1
hendrydong/Mistral-RM-baseline-No-Safety-Alignment
Text Classification
•
Updated
•
7