See our paper at https://huggingface.co/papers/2405.19332.
Shenao Zhang
ZhangShenao
AI & ML interests
None yet
Organizations
Collections
3
-
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
Text Generation • Updated • 241 • 5 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-2
Text Generation • Updated • 18 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-1
Text Generation • Updated • 24 -
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment
Paper • 2405.19332 • Published • 15
Papers
1
models
24
ZhangShenao/gemma-1.1-7b-it_MetaMathQA_ent0.05_beam1_dosampleFalse_temp1.0_estep_
Updated
•
4
ZhangShenao/baseline-gemma-2-2b-it-sft
Text Generation
•
Updated
•
917
ZhangShenao/baseline-gemma-2-9b-it-sft
Text Generation
•
Updated
•
165
ZhangShenao/baseline-gemma-1.1-7b-it-sft
Text Generation
•
Updated
•
127
ZhangShenao/baseline-Mistral-7B-Instruct-v0.2-sft
Text Generation
•
Updated
•
139
ZhangShenao/baseline-Llama-3-8B-Instruct-sft
Text Generation
•
Updated
•
91
ZhangShenao/newgemma-2-2b-it-sft-m
Updated
•
5
ZhangShenao/gemma-2-2b-it-sft-m
Text Generation
•
Updated
•
48
ZhangShenao/gemma9b-sft-m
Text Generation
•
Updated
•
45
ZhangShenao/gemma9b-sft-m-pack
Updated