See our paper at https://huggingface.co/papers/2405.19332.
Shenao Zhang
ZhangShenao
AI & ML interests
None yet
Organizations
Collections
3
-
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
Text Generation • Updated • 98 • 4 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-2
Text Generation • Updated • 14 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-1
Text Generation • Updated • 15 -
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment
Paper • 2405.19332 • Published • 15
Papers
1
models
10
ZhangShenao/SELM-Phi-3-mini-4k-instruct-iter-1
Text Generation
•
Updated
•
37
ZhangShenao/SELM-Phi-3-mini-4k-instruct-iter-2
Text Generation
•
Updated
•
34
ZhangShenao/SELM-Phi-3-mini-4k-instruct-iter-3
Text Generation
•
Updated
•
33
•
1
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-1
Text Generation
•
Updated
•
15
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-2
Text Generation
•
Updated
•
14
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
Text Generation
•
Updated
•
98
•
4
ZhangShenao/DPO-Zephyr-7B
Text Generation
•
Updated
•
5
ZhangShenao/SELM-Zephyr-7B-iter-1
Text Generation
•
Updated
•
8
ZhangShenao/SELM-Zephyr-7B-iter-2
Text Generation
•
Updated
•
7
ZhangShenao/SELM-Zephyr-7B-iter-3
Text Generation
•
Updated
•
12
•
3
datasets
34
ZhangShenao/Gemma-relabel-dpo
Viewer
•
Updated
•
122k
•
210
ZhangShenao/Gemma-relabel
Viewer
•
Updated
•
122k
•
5
ZhangShenao/Qwen-relabel-dpo
Viewer
•
Updated
•
122k
•
249
ZhangShenao/gcbinarized_posonly_ultrafeedback
Viewer
•
Updated
•
49.6k
•
2
ZhangShenao/gcbinarized_ultrafeedback_nosys
Viewer
•
Updated
•
97.1k
•
850
ZhangShenao/gcmode_fine_ultrafeedback
Viewer
•
Updated
•
97.1k
•
2
ZhangShenao/gcbinarized_fine_ultrafeedback
Viewer
•
Updated
•
97.1k
•
1.58k
ZhangShenao/newbin_ultrafeedback
Viewer
•
Updated
•
124k
•
2
ZhangShenao/gc_fine_ultrafeedback_nosys_noinst
Viewer
•
Updated
•
97.1k
•
2
ZhangShenao/newpgc_fine_ultrafeedback_implicit
Viewer
•
Updated
•
121k
•
2