commit

Files changed (4) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5231
 ## Model description
@@ -52,19 +52,19 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 0.8788        | 0.2256 | 50   | 0.8798          |
-| 0.6183        | 0.4512 | 100  | 0.6252          |
-| 0.5702        | 0.6768 | 150  | 0.5789          |
-| 0.6166        | 0.9024 | 200  | 0.5593          |
-| 0.5435        | 1.1280 | 250  | 0.5481          |
-| 0.5317        | 1.3536 | 300  | 0.5401          |
-| 0.5013        | 1.5792 | 350  | 0.5345          |
-| 0.5488        | 1.8049 | 400  | 0.5307          |
-| 0.5368        | 2.0305 | 450  | 0.5280          |
-| 0.4823        | 2.2561 | 500  | 0.5259          |
-| 0.5289        | 2.4817 | 550  | 0.5245          |
-| 0.5358        | 2.7073 | 600  | 0.5237          |
-| 0.5462        | 2.9329 | 650  | 0.5231          |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5221
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 0.8799        | 0.2256 | 50   | 0.8806          |
+| 0.6125        | 0.4512 | 100  | 0.6177          |
+| 0.5659        | 0.6768 | 150  | 0.5733          |
+| 0.6125        | 0.9024 | 200  | 0.5550          |
+| 0.5402        | 1.1280 | 250  | 0.5448          |
+| 0.5299        | 1.3536 | 300  | 0.5378          |
+| 0.4997        | 1.5792 | 350  | 0.5331          |
+| 0.549         | 1.8049 | 400  | 0.5296          |
+| 0.5361        | 2.0305 | 450  | 0.5268          |
+| 0.4821        | 2.2561 | 500  | 0.5249          |
+| 0.5274        | 2.4817 | 550  | 0.5234          |
+| 0.5344        | 2.7073 | 600  | 0.5227          |
+| 0.546         | 2.9329 | 650  | 0.5221          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,13 +20,13 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "q_proj",
-    "up_proj",
-    "k_proj",
     "o_proj",
-    "v_proj",
-    "gate_proj",
-    "down_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v_proj",
     "q_proj",
     "o_proj",
+    "down_proj",
+    "k_proj",
+    "up_proj",
+    "gate_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6c413dc3dcf0baedba774975f93b08ab86adb4cf1d644d82b5088ea50c1bd1b1
 size 35668592

 version https://git-lfs.github.com/spec/v1
+oid sha256:b3e19071bfc5c8fcc5c64ae77a9c64c5ca1474c94b89671c75cc0e2f2f803b22
 size 35668592

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a7ea80feb8a2ff2b992fb9e3013b5abe59ebcf83a82ee40afbae3b78e5b99a29
 size 5496

 version https://git-lfs.github.com/spec/v1
+oid sha256:2d2d90cf6c5ffa30dd14523c8aefb12bb129b42ab45f589808fac387e89e28e6
 size 5496