ZhangShenao
commited on
Commit
•
141a346
1
Parent(s):
862e9b8
Update README.md
Browse files
README.md
CHANGED
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
18 |
|
19 |
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment.
|
20 |
|
21 |
-
# SELM-Zephyr-7B-iter-
|
22 |
|
23 |
This model is a fine-tuned version of [ZhangShenao/SELM-Zephyr-7B-iter-1](https://huggingface.co/ZhangShenao/SELM-Zephyr-7B-iter-1) using synthetic data based on on the HuggingFaceH4/ultrafeedback_binarized dataset.
|
24 |
|
|
|
18 |
|
19 |
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment.
|
20 |
|
21 |
+
# SELM-Zephyr-7B-iter-2
|
22 |
|
23 |
This model is a fine-tuned version of [ZhangShenao/SELM-Zephyr-7B-iter-1](https://huggingface.co/ZhangShenao/SELM-Zephyr-7B-iter-1) using synthetic data based on on the HuggingFaceH4/ultrafeedback_binarized dataset.
|
24 |
|