colin-abacus
commited on
Commit
•
34d54c6
1
Parent(s):
d136d4b
Update README.md
Browse files
README.md
CHANGED
@@ -43,7 +43,7 @@ With reference model jondurbin/bagel-34b-v0.2:
|
|
43 |
Please cite the paper if you use data, model, or method in this repo.
|
44 |
|
45 |
```
|
46 |
-
@article{
|
47 |
title={Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive},
|
48 |
author={Pal, Arka and Karkhanis, Deep and Dooley, Samuel and Roberts, Manley and Naidu, Siddartha and White, Colin},
|
49 |
journal={arXiv preprint arXiv:2402.13228},
|
|
|
43 |
Please cite the paper if you use data, model, or method in this repo.
|
44 |
|
45 |
```
|
46 |
+
@article{pal2024smaug,
|
47 |
title={Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive},
|
48 |
author={Pal, Arka and Karkhanis, Deep and Dooley, Samuel and Roberts, Manley and Naidu, Siddartha and White, Colin},
|
49 |
journal={arXiv preprint arXiv:2402.13228},
|