SleepyGorilla/Mistral_7B

Browse files

Files changed (4) hide show

README.md +14 -14
adapter_model.safetensors +1 -1
runs/Mar16_18-59-36_f0843e752b6c/events.out.tfevents.1710615755.f0843e752b6c.971.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -18,15 +18,15 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/OpenHermes-2-Mistral-7B-GPTQ](https://huggingface.co/TheBloke/OpenHermes-2-Mistral-7B-GPTQ) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6066
-- Rewards/chosen: 0.0894
-- Rewards/rejected: -0.0023
-- Rewards/accuracies: 0.5
-- Rewards/margins: 0.0917
-- Logps/rejected: -158.0340
-- Logps/chosen: -152.2324
-- Logits/rejected: -2.3521
-- Logits/chosen: -2.4734
 ## Model description
@@ -59,11 +59,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Rewards/chosen | Rewards/rejected | Rewards/accuracies | Rewards/margins | Logps/rejected | Logps/chosen | Logits/rejected | Logits/chosen |
 |:-------------:|:-----:|:----:|:---------------:|:--------------:|:----------------:|:------------------:|:---------------:|:--------------:|:------------:|:---------------:|:-------------:|
-| 0.6778        | 0.01  | 10   | 0.6796          | -0.0179        | -0.0353          | 0.625              | 0.0174          | -158.3634      | -153.3049    | -2.3602         | -2.4849       |
-| 0.7178        | 0.01  | 20   | 0.6321          | 0.0123         | -0.0701          | 0.5625             | 0.0824          | -158.7113      | -153.0028    | -2.3602         | -2.4815       |
-| 0.7263        | 0.01  | 30   | 0.6050          | 0.0538         | -0.0401          | 0.5                | 0.0939          | -158.4116      | -152.5882    | -2.3567         | -2.4779       |
-| 0.6941        | 0.02  | 40   | 0.6047          | 0.0834         | -0.0114          | 0.5                | 0.0948          | -158.1251      | -152.2924    | -2.3545         | -2.4749       |
-| 0.6889        | 0.03  | 50   | 0.6066          | 0.0894         | -0.0023          | 0.5                | 0.0917          | -158.0340      | -152.2324    | -2.3521         | -2.4734       |
 ### Framework versions

 This model is a fine-tuned version of [TheBloke/OpenHermes-2-Mistral-7B-GPTQ](https://huggingface.co/TheBloke/OpenHermes-2-Mistral-7B-GPTQ) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4618
+- Rewards/chosen: -0.7871
+- Rewards/rejected: -6.6095
+- Rewards/accuracies: 0.9375
+- Rewards/margins: 5.8223
+- Logps/rejected: -220.7533
+- Logps/chosen: -104.2417
+- Logits/rejected: -1.9929
+- Logits/chosen: -2.4654
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Rewards/chosen | Rewards/rejected | Rewards/accuracies | Rewards/margins | Logps/rejected | Logps/chosen | Logits/rejected | Logits/chosen |
 |:-------------:|:-----:|:----:|:---------------:|:--------------:|:----------------:|:------------------:|:---------------:|:--------------:|:------------:|:---------------:|:-------------:|
+| 0.5633        | 0.0   | 10   | 0.5184          | 0.0815         | -0.3179          | 1.0                | 0.3995          | -157.8381      | -95.5550     | -2.1002         | -2.5591       |
+| 0.3936        | 0.0   | 20   | 0.2875          | 0.2689         | -1.1716          | 1.0                | 1.4404          | -166.3746      | -93.6817     | -2.0675         | -2.5604       |
+| 0.2132        | 0.0   | 30   | 0.2000          | 0.0966         | -2.8012          | 0.9375             | 2.8977          | -182.6702      | -95.4047     | -2.0286         | -2.5520       |
+| 0.034         | 0.0   | 40   | 0.4078          | -0.5324        | -5.2327          | 0.9375             | 4.7003          | -206.9856      | -101.6947    | -2.0143         | -2.4945       |
+| 0.0665        | 0.0   | 50   | 0.4618          | -0.7871        | -6.6095          | 0.9375             | 5.8223          | -220.7533      | -104.2417    | -1.9929         | -2.4654       |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6d02a50a48b799381d2fda7c83289013677bec6f6cf3bd48d90a469de0e5200d
 size 13648432

 version https://git-lfs.github.com/spec/v1
+oid sha256:8ed8c23e952bf13275720438d5d8d1c6a13442e0255a60ba3d8d314d209927e0
 size 13648432

runs/Mar16_18-59-36_f0843e752b6c/events.out.tfevents.1710615755.f0843e752b6c.971.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a0d94056905effb0affb7732cd532de2a1f24b99e4217269ca04772ba8aad093
+size 13334

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9aa71f747c0e84e909ff14e782bb48b4c444b31fcb9b3b30054f2d6014c4595c
 size 4475

 version https://git-lfs.github.com/spec/v1
+oid sha256:6b990e32d43a9f3ad091100b93ca5b758b676985de0fb98bd08c011f72e533bc
 size 4475