fblgit
/

una-llama-7b

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions

fblgit commited on Dec 19, 2023

Commit

6321d1b

•

1 Parent(s): d5bbd75

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -10,9 +10,9 @@ model-index:
 ---
 # una-llama-7b
-This model is a fine-tuned version of [huggyllama/llama-7b](https://huggingface.co/huggyllama/llama-7b) on the allenai/ultrafeedback_binarized_cleaned dataset.
-It achieves the following results on the evaluation set:
 - Loss: 0.5529
 - Rewards/chosen: 0.3633
 - Rewards/rejected: -0.1873

 ---
 # una-llama-7b
+**UNA: Uniform Neural Alignment** It increases 6.75% the performance of the pre-trained base LLaMA (1) 7B.
+This model is a fine-tuned version of [huggyllama/llama-7b](https://huggingface.co/huggyllama/llama-7b):
 - Loss: 0.5529
 - Rewards/chosen: 0.3633
 - Rewards/rejected: -0.1873