End of training

Browse files

Files changed (4) hide show

README.md +16 -16
adapter_config.json +2 -2
adapter_model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -14,18 +14,18 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/pauld/huggingface/runs/hzsslk0p)
 # null
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5456
-- Eval/rewards/chosen: 3.9143
-- Eval/logps/chosen: -136.1776
-- Eval/rewards/rejected: 3.5677
-- Eval/logps/rejected: -182.5892
-- Eval/rewards/margins: 0.3466
-- Eval/kl: 35.7288
 ## Model description
@@ -44,7 +44,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0001
 - train_batch_size: 1
 - eval_batch_size: 2
 - seed: 42
@@ -57,13 +57,13 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss |         |
-|:-------------:|:------:|:----:|:---------------:|:-------:|
-| 0.6027        | 0.9677 | 15   | 0.5987          | 1.5787  |
-| 0.2879        | 2.0    | 31   | 0.6263          | 29.2556 |
-| 0.2962        | 2.9677 | 46   | 0.5909          | 33.4994 |
-| 0.132         | 4.0    | 62   | 0.5446          | 35.5494 |
-| 0.2602        | 4.8387 | 75   | 0.5456          | 35.7288 |
 ### Framework versions

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/pauld/huggingface/runs/5ep5fter)
 # null
 This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6004
+- Eval/rewards/chosen: 0.0713
+- Eval/logps/chosen: -174.6075
+- Eval/rewards/rejected: 0.0986
+- Eval/logps/rejected: -217.2799
+- Eval/rewards/margins: -0.0273
+- Eval/kl: 0.7783
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-05
 - train_batch_size: 1
 - eval_batch_size: 2
 - seed: 42
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss |        |
+|:-------------:|:------:|:----:|:---------------:|:------:|
+| 0.5651        | 0.9677 | 15   | 0.6026          | 0.1513 |
+| 0.5618        | 2.0    | 31   | 0.5999          | 0.3742 |
+| 0.5484        | 2.9677 | 46   | 0.6006          | 0.6711 |
+| 0.5466        | 4.0    | 62   | 0.6003          | 0.8158 |
+| 0.6017        | 4.8387 | 75   | 0.6004          | 0.7783 |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,9 +20,9 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_proj",
-    "q_proj",
     "o_proj",
     "k_proj"
   ],
   "task_type": "CAUSAL_LM",

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "o_proj",
+    "q_proj",
+    "v_proj",
     "k_proj"
   ],
   "task_type": "CAUSAL_LM",

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f7e591e83080650357ebdd51a92f634835ee2446cd551b76a94e0addcac5b3f6
 size 27297544

 version https://git-lfs.github.com/spec/v1
+oid sha256:4edc5b055047a4581200f67f01507ba846466a2b7282095139147dcd1ad9c1bd
 size 27297544

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:63e2effd728528405fb345aefe98f2d8b1313b9674f6f0761c236d1952a804c8
 size 5496

 version https://git-lfs.github.com/spec/v1
+oid sha256:e7fc5292ef250c9e91e56e402dfafbfefc1f1f354278c948929e86f2b4fca2b2
 size 5496