Update README.md
Browse files
README.md
CHANGED
@@ -24,12 +24,7 @@ We train [OpenRLHF/Llama-3-8b-sft-mixture](https://huggingface.co/OpenRLHF/Llama
|
|
24 |
with our calibrated reward model [HINT-lab/llama3-8b-crm-final-v0.1](https://huggingface.co/HINT-lab/llama3-8b-crm-final-v0.1).
|
25 |
|
26 |
- **Developed by:** Jixuan Leng, Chengsong Huang, Banghua Zhu, Jiaxin Huang
|
27 |
-
|
28 |
-
- **Shared by [optional]:** [More Information Needed] -->
|
29 |
-
<!-- - **Model type:** [More Information Needed]
|
30 |
-
- **Language(s) (NLP):** [More Information Needed]
|
31 |
-
- **License:** [More Information Needed] -->
|
32 |
-
- **Finetuned from model [optional]:** [OpenRLHF/Llama-3-8b-sft-mixture](https://huggingface.co/OpenRLHF/Llama-3-8b-sft-mixture)
|
33 |
|
34 |
### Model Sources [optional]
|
35 |
|
|
|
24 |
with our calibrated reward model [HINT-lab/llama3-8b-crm-final-v0.1](https://huggingface.co/HINT-lab/llama3-8b-crm-final-v0.1).
|
25 |
|
26 |
- **Developed by:** Jixuan Leng, Chengsong Huang, Banghua Zhu, Jiaxin Huang
|
27 |
+
- **Finetuned from model:** [OpenRLHF/Llama-3-8b-sft-mixture](https://huggingface.co/OpenRLHF/Llama-3-8b-sft-mixture)
|
|
|
|
|
|
|
|
|
|
|
28 |
|
29 |
### Model Sources [optional]
|
30 |
|