Hanze Dong's picture

6 4 18

Hanze Dong

hendrydong

·

https://hendrydong.github.io

hendrydong

AI & ML interests

None yet

Recent Activity

New activity 9 days ago

RLHFlow/LLaMA3.2-1B-SFT:the training data for this model?

New activity about 2 months ago

sfairXC/FsfairX-LLaMA3-RM-v0.1:Update README.md

updated a model about 2 months ago

sfairXC/FsfairX-LLaMA3-RM-v0.1

View all activity

Organizations

Papers 11

arxiv:2410.04698

arxiv:2407.21018

arxiv:2405.07863

arxiv:2312.11456

models 5

hendrydong/dpo_offline_700K

Text Generation • Updated Aug 3 • 5

hendrydong/llama3

hendrydong/dpo_K8_max_max

Text Generation • Updated Apr 2 • 6

hendrydong/Mistral-RM-for-RAFT-GSHF-v0

Text Classification • Updated Mar 23 • 9 • 1

hendrydong/Mistral-RM-baseline-No-Safety-Alignment

Text Classification • Updated Mar 23 • 7

datasets 4

hendrydong/preference_700K

Viewer • Updated Sep 28 • 700k • 906 • 7

hendrydong/prompt-0814

Viewer • Updated Aug 14 • 176k • 33

hendrydong/hendrycks_math_prompt

Viewer • Updated Aug 8 • 12.5k • 32

hendrydong/rlhf_helpful_eval

Viewer • Updated Dec 18, 2023 • 5.74k • 84