ryota39 commited on
Commit
a6eb815
1 Parent(s): ece0a61

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -18,6 +18,8 @@ model-index:
18
  - the probability (logits after passing softmax function) in last layer of this model can be used to quantify the preference from user input
19
  - fine-tuned [studio-ousia/mluke-large-lite](https://huggingface.co/studio-ousia/mluke-large-lite) via full parameter tuning using [open-preference-v0.3](https://huggingface.co/datasets/ryota39/open_preference-v0.3)
20
  - trained on bf16 format
 
 
21
 
22
  ## Metric
23
 
 
18
  - the probability (logits after passing softmax function) in last layer of this model can be used to quantify the preference from user input
19
  - fine-tuned [studio-ousia/mluke-large-lite](https://huggingface.co/studio-ousia/mluke-large-lite) via full parameter tuning using [open-preference-v0.3](https://huggingface.co/datasets/ryota39/open_preference-v0.3)
20
  - trained on bf16 format
21
+ - Label 0 stands for rejected sentence
22
+ - Label 1 stands for chosen sentence
23
 
24
  ## Metric
25