SleepyGorilla commited on
Commit
0e90f21
1 Parent(s): bababa8

SleepyGorilla/Mistral_7B

Browse files
README.md CHANGED
@@ -18,15 +18,15 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [TheBloke/OpenHermes-2-Mistral-7B-GPTQ](https://huggingface.co/TheBloke/OpenHermes-2-Mistral-7B-GPTQ) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.0082
22
- - Rewards/chosen: -0.7999
23
- - Rewards/rejected: -11.8804
24
  - Rewards/accuracies: 1.0
25
- - Rewards/margins: 11.0804
26
- - Logps/rejected: -383.9451
27
- - Logps/chosen: -160.8140
28
- - Logits/rejected: -2.4692
29
- - Logits/chosen: -2.6059
30
 
31
  ## Model description
32
 
@@ -59,11 +59,11 @@ The following hyperparameters were used during training:
59
 
60
  | Training Loss | Epoch | Step | Validation Loss | Rewards/chosen | Rewards/rejected | Rewards/accuracies | Rewards/margins | Logps/rejected | Logps/chosen | Logits/rejected | Logits/chosen |
61
  |:-------------:|:-----:|:----:|:---------------:|:--------------:|:----------------:|:------------------:|:---------------:|:--------------:|:------------:|:---------------:|:-------------:|
62
- | 0.5652 | 0.0 | 10 | 0.4581 | 0.1358 | -0.6615 | 1.0 | 0.7973 | -271.7569 | -151.4569 | -2.4902 | -2.7008 |
63
- | 0.3737 | 0.0 | 20 | 0.1877 | 0.0778 | -2.8831 | 1.0 | 2.9609 | -293.9724 | -152.0366 | -2.4897 | -2.6893 |
64
- | 0.2022 | 0.0 | 30 | 0.0621 | -0.1154 | -6.0503 | 1.0 | 5.9349 | -325.6448 | -153.9687 | -2.4890 | -2.6603 |
65
- | 0.0284 | 0.0 | 40 | 0.0155 | -0.5833 | -10.1231 | 1.0 | 9.5397 | -366.3722 | -158.6483 | -2.4792 | -2.6266 |
66
- | 0.0593 | 0.0 | 50 | 0.0082 | -0.7999 | -11.8804 | 1.0 | 11.0804 | -383.9451 | -160.8140 | -2.4692 | -2.6059 |
67
 
68
 
69
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [TheBloke/OpenHermes-2-Mistral-7B-GPTQ](https://huggingface.co/TheBloke/OpenHermes-2-Mistral-7B-GPTQ) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.0132
22
+ - Rewards/chosen: -1.4792
23
+ - Rewards/rejected: -8.5855
24
  - Rewards/accuracies: 1.0
25
+ - Rewards/margins: 7.1064
26
+ - Logps/rejected: -319.5252
27
+ - Logps/chosen: -138.4254
28
+ - Logits/rejected: -2.3872
29
+ - Logits/chosen: -2.5369
30
 
31
  ## Model description
32
 
 
59
 
60
  | Training Loss | Epoch | Step | Validation Loss | Rewards/chosen | Rewards/rejected | Rewards/accuracies | Rewards/margins | Logps/rejected | Logps/chosen | Logits/rejected | Logits/chosen |
61
  |:-------------:|:-----:|:----:|:---------------:|:--------------:|:----------------:|:------------------:|:---------------:|:--------------:|:------------:|:---------------:|:-------------:|
62
+ | 0.5575 | 0.0 | 10 | 0.4017 | 0.0150 | -0.6143 | 1.0 | 0.6293 | -239.8125 | -123.4837 | -2.4102 | -2.6084 |
63
+ | 0.3781 | 0.0 | 20 | 0.1298 | -0.2390 | -2.2414 | 1.0 | 2.0025 | -256.0842 | -126.0231 | -2.3786 | -2.6120 |
64
+ | 0.219 | 0.0 | 30 | 0.0410 | -0.5640 | -4.3638 | 1.0 | 3.7998 | -277.3080 | -129.2739 | -2.3879 | -2.5872 |
65
+ | 0.038 | 0.0 | 40 | 0.0168 | -1.2083 | -7.3369 | 1.0 | 6.1286 | -307.0389 | -135.7168 | -2.3962 | -2.5566 |
66
+ | 0.0669 | 0.0 | 50 | 0.0132 | -1.4792 | -8.5855 | 1.0 | 7.1064 | -319.5252 | -138.4254 | -2.3872 | -2.5369 |
67
 
68
 
69
  ### Framework versions
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8aad9c8eb98d77811ee0c01df9374f8797047a2d66645d3baab7eb30b46d3872
3
  size 13648432
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:58a9849f918c2040073e354ef8b6860deae289185db34920861266a53e2e876e
3
  size 13648432
runs/Mar18_11-00-46_0d00f13c1b40/events.out.tfevents.1710759826.0d00f13c1b40.247.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:500e94250634a61337d19cb66fdd2913cbcfdb21a2b0c39133dd60061c3f8ca4
3
+ size 13334
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:25b852a42ed5f46c5c36a13523b618fee1a1f050d50b0eba580c2957cfc51a1e
3
  size 4475
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4087bb6f460ec3cc9afe55fb6b825229b72adfd18a1c406f7ebba757842a8f84
3
  size 4475