ryota39 commited on
Commit
bd7427e
1 Parent(s): 90e516a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -17
README.md CHANGED
@@ -16,30 +16,29 @@ model-index:
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
  should probably proofread and complete it, then remove this comment. -->
18
 
19
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/rspeech3399/huggingface/runs/uv90lda8)
20
- # out
 
 
 
21
 
22
- This model is a fine-tuned version of [studio-ousia/mluke-large-lite](https://huggingface.co/studio-ousia/mluke-large-lite) on an unknown dataset.
23
- It achieves the following results on the evaluation set:
24
- - Loss: 0.1615
25
- - Accuracy: 0.9399
26
- - Precision: 0.9346
27
- - Recall: 0.9460
28
- - F1: 0.9403
29
 
30
- ## Model description
31
 
32
- More information needed
 
 
33
 
34
- ## Intended uses & limitations
35
 
36
- More information needed
 
 
37
 
38
- ## Training and evaluation data
39
 
40
- More information needed
41
-
42
- ## Training procedure
43
 
44
  ### Training hyperparameters
45
 
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
  should probably proofread and complete it, then remove this comment. -->
18
 
19
+ ## Fine-tuning
20
+ - this model was trained to classify whether input text comes from "chosen sentence" or "rejected sentence"
21
+ - the probability (logits after passing softmax function) in last layer of this model can be used to quantify the preference from user input
22
+ - fine-tuned [studio-ousia/mluke-large-lite](https://huggingface.co/studio-ousia/mluke-large-lite) via full parameter tuning using [open-preference-v0.3](https://huggingface.co/datasets/ryota39/open_preference-v0.3)
23
+ - trained on bf16 format
24
 
25
+ ## Metric
 
 
 
 
 
 
26
 
27
+ - train and validation split
28
 
29
+ |train loss|eval loss|accuracy|recall|precision|f1-score|
30
+ |:---|:---|:---|:---|:---|:---|
31
+ |0.114|0.1615|0.9399|0.9459|0.9346|0.9402|
32
 
33
+ - test split
34
 
35
+ |accuracy|recall|precision|f1-score|
36
+ |:---|:---|:---|:---|
37
+ |0.9416|0.9319|0.9504|0.9411|
38
 
39
+ - confusion matrix when test split
40
 
41
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/651e3f30ca333f3c8df692b8/00ONMe0qlqv7XB14ttrPY.png)
 
 
42
 
43
  ### Training hyperparameters
44