Update README.md
Browse files
README.md
CHANGED
@@ -16,30 +16,29 @@ model-index:
|
|
16 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
17 |
should probably proofread and complete it, then remove this comment. -->
|
18 |
|
19 |
-
|
20 |
-
|
|
|
|
|
|
|
21 |
|
22 |
-
|
23 |
-
It achieves the following results on the evaluation set:
|
24 |
-
- Loss: 0.1615
|
25 |
-
- Accuracy: 0.9399
|
26 |
-
- Precision: 0.9346
|
27 |
-
- Recall: 0.9460
|
28 |
-
- F1: 0.9403
|
29 |
|
30 |
-
|
31 |
|
32 |
-
|
|
|
|
|
33 |
|
34 |
-
|
35 |
|
36 |
-
|
|
|
|
|
37 |
|
38 |
-
|
39 |
|
40 |
-
|
41 |
-
|
42 |
-
## Training procedure
|
43 |
|
44 |
### Training hyperparameters
|
45 |
|
|
|
16 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
17 |
should probably proofread and complete it, then remove this comment. -->
|
18 |
|
19 |
+
## Fine-tuning
|
20 |
+
- this model was trained to classify whether input text comes from "chosen sentence" or "rejected sentence"
|
21 |
+
- the probability (logits after passing softmax function) in last layer of this model can be used to quantify the preference from user input
|
22 |
+
- fine-tuned [studio-ousia/mluke-large-lite](https://huggingface.co/studio-ousia/mluke-large-lite) via full parameter tuning using [open-preference-v0.3](https://huggingface.co/datasets/ryota39/open_preference-v0.3)
|
23 |
+
- trained on bf16 format
|
24 |
|
25 |
+
## Metric
|
|
|
|
|
|
|
|
|
|
|
|
|
26 |
|
27 |
+
- train and validation split
|
28 |
|
29 |
+
|train loss|eval loss|accuracy|recall|precision|f1-score|
|
30 |
+
|:---|:---|:---|:---|:---|:---|
|
31 |
+
|0.114|0.1615|0.9399|0.9459|0.9346|0.9402|
|
32 |
|
33 |
+
- test split
|
34 |
|
35 |
+
|accuracy|recall|precision|f1-score|
|
36 |
+
|:---|:---|:---|:---|
|
37 |
+
|0.9416|0.9319|0.9504|0.9411|
|
38 |
|
39 |
+
- confusion matrix when test split
|
40 |
|
41 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/651e3f30ca333f3c8df692b8/00ONMe0qlqv7XB14ttrPY.png)
|
|
|
|
|
42 |
|
43 |
### Training hyperparameters
|
44 |
|