ryota39
/

luke-japanese-base-lite-reward

Text Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

ryota39 commited on Jul 5

Commit

a6eb815

•

1 Parent(s): ece0a61

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -18,6 +18,8 @@ model-index:
 - the probability (logits after passing softmax function) in last layer of this model can be used to quantify the preference from user input
 - fine-tuned [studio-ousia/mluke-large-lite](https://huggingface.co/studio-ousia/mluke-large-lite) via full parameter tuning using [open-preference-v0.3](https://huggingface.co/datasets/ryota39/open_preference-v0.3)
 - trained on bf16 format
 ## Metric

 - the probability (logits after passing softmax function) in last layer of this model can be used to quantify the preference from user input
 - fine-tuned [studio-ousia/mluke-large-lite](https://huggingface.co/studio-ousia/mluke-large-lite) via full parameter tuning using [open-preference-v0.3](https://huggingface.co/datasets/ryota39/open_preference-v0.3)
 - trained on bf16 format
+- Label 0 stands for rejected sentence
+- Label 1 stands for chosen sentence
 ## Metric