Files changed (1) hide show
  1. README.md +14 -1
README.md CHANGED
@@ -1 +1,14 @@
1
- This is a model released from the preprint: *[SimPO: Simple Preference Optimization with a Reference-Free Reward](https://arxiv.org/abs/2405.14734)* Please refer to our [repository](https://github.com/princeton-nlp/SimPO) for more details.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3
3
+ pipeline_tag: text-generation
4
+ library_name: transformers
5
+ tags:
6
+ - KTO
7
+ datasets:
8
+ - princeton-nlp/llama3-ultrafeedback
9
+ language:
10
+ - en
11
+ base_model:
12
+ - meta-llama/Meta-Llama-3-8B-Instruct
13
+ ---
14
+ This is a model released from the preprint: *[SimPO: Simple Preference Optimization with a Reference-Free Reward](https://arxiv.org/abs/2405.14734)* Please refer to our [repository](https://github.com/princeton-nlp/SimPO) for more details.