PaulD commited on
Commit
3b6cf08
1 Parent(s): 85ec019

End of training

Browse files
README.md CHANGED
@@ -14,18 +14,18 @@ model-index:
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/pauld/huggingface/runs/hzsslk0p)
18
  # null
19
 
20
  This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.5456
23
- - Eval/rewards/chosen: 3.9143
24
- - Eval/logps/chosen: -136.1776
25
- - Eval/rewards/rejected: 3.5677
26
- - Eval/logps/rejected: -182.5892
27
- - Eval/rewards/margins: 0.3466
28
- - Eval/kl: 35.7288
29
 
30
  ## Model description
31
 
@@ -44,7 +44,7 @@ More information needed
44
  ### Training hyperparameters
45
 
46
  The following hyperparameters were used during training:
47
- - learning_rate: 0.0001
48
  - train_batch_size: 1
49
  - eval_batch_size: 2
50
  - seed: 42
@@ -57,13 +57,13 @@ The following hyperparameters were used during training:
57
 
58
  ### Training results
59
 
60
- | Training Loss | Epoch | Step | Validation Loss | |
61
- |:-------------:|:------:|:----:|:---------------:|:-------:|
62
- | 0.6027 | 0.9677 | 15 | 0.5987 | 1.5787 |
63
- | 0.2879 | 2.0 | 31 | 0.6263 | 29.2556 |
64
- | 0.2962 | 2.9677 | 46 | 0.5909 | 33.4994 |
65
- | 0.132 | 4.0 | 62 | 0.5446 | 35.5494 |
66
- | 0.2602 | 4.8387 | 75 | 0.5456 | 35.7288 |
67
 
68
 
69
  ### Framework versions
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/pauld/huggingface/runs/5ep5fter)
18
  # null
19
 
20
  This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.6004
23
+ - Eval/rewards/chosen: 0.0713
24
+ - Eval/logps/chosen: -174.6075
25
+ - Eval/rewards/rejected: 0.0986
26
+ - Eval/logps/rejected: -217.2799
27
+ - Eval/rewards/margins: -0.0273
28
+ - Eval/kl: 0.7783
29
 
30
  ## Model description
31
 
 
44
  ### Training hyperparameters
45
 
46
  The following hyperparameters were used during training:
47
+ - learning_rate: 1e-05
48
  - train_batch_size: 1
49
  - eval_batch_size: 2
50
  - seed: 42
 
57
 
58
  ### Training results
59
 
60
+ | Training Loss | Epoch | Step | Validation Loss | |
61
+ |:-------------:|:------:|:----:|:---------------:|:------:|
62
+ | 0.5651 | 0.9677 | 15 | 0.6026 | 0.1513 |
63
+ | 0.5618 | 2.0 | 31 | 0.5999 | 0.3742 |
64
+ | 0.5484 | 2.9677 | 46 | 0.6006 | 0.6711 |
65
+ | 0.5466 | 4.0 | 62 | 0.6003 | 0.8158 |
66
+ | 0.6017 | 4.8387 | 75 | 0.6004 | 0.7783 |
67
 
68
 
69
  ### Framework versions
adapter_config.json CHANGED
@@ -20,9 +20,9 @@
20
  "rank_pattern": {},
21
  "revision": null,
22
  "target_modules": [
23
- "v_proj",
24
- "q_proj",
25
  "o_proj",
 
 
26
  "k_proj"
27
  ],
28
  "task_type": "CAUSAL_LM",
 
20
  "rank_pattern": {},
21
  "revision": null,
22
  "target_modules": [
 
 
23
  "o_proj",
24
+ "q_proj",
25
+ "v_proj",
26
  "k_proj"
27
  ],
28
  "task_type": "CAUSAL_LM",
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f7e591e83080650357ebdd51a92f634835ee2446cd551b76a94e0addcac5b3f6
3
  size 27297544
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4edc5b055047a4581200f67f01507ba846466a2b7282095139147dcd1ad9c1bd
3
  size 27297544
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:63e2effd728528405fb345aefe98f2d8b1313b9674f6f0761c236d1952a804c8
3
  size 5496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e7fc5292ef250c9e91e56e402dfafbfefc1f1f354278c948929e86f2b4fca2b2
3
  size 5496