shareAI
/

llama3.1-8b-instruct-dpo-zh

Question Answering

Inference Endpoints

Model card Files Files and versions Community

Baicai003 commited on Jul 25

Commit

63250c7

•

1 Parent(s): f95efe0

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ Github：https://github.com/CrazyBoyM/llama3-Chinese-chat
 放出训练配方细节供网友参考分享：
 DPO(beta 0.5) + lora rank128, alpha256 + 打开"lm_head", "input_layernorm", "post_attention_layernorm", "norm"层训练.
 特点：偏好中文和emoji表情，且不损伤原instruct版模型能力。实测中文DPO版问答性能体验超过现在市面上任何llama3中文微调版 （微调会破坏llama3原版能力，导致遗忘）
-![Alt text](image.png)
 ### 模型部署
 网页脚本文件：https://github.com/CrazyBoyM/llama3-Chinese-chat/blob/main/deploy/web_streamlit_for_instruct_v2.py

 放出训练配方细节供网友参考分享：
 DPO(beta 0.5) + lora rank128, alpha256 + 打开"lm_head", "input_layernorm", "post_attention_layernorm", "norm"层训练.
 特点：偏好中文和emoji表情，且不损伤原instruct版模型能力。实测中文DPO版问答性能体验超过现在市面上任何llama3中文微调版 （微调会破坏llama3原版能力，导致遗忘）
+![Alt text](https://modelscope.cn/api/v1/models/baicai003/Llama3-Chinese-instruct-DPO-beta0.5/repo?Revision=master&FilePath=image.png&View=true)
 ### 模型部署
 网页脚本文件：https://github.com/CrazyBoyM/llama3-Chinese-chat/blob/main/deploy/web_streamlit_for_instruct_v2.py