vicgalle's picture
Update README.md
b4a01c1 verified
---
license: apache-2.0
datasets:
- jondurbin/truthy-dpo-v0.1
---
## Solarized-18B-truthy
Solarized-18B-dpo fine-tuned to improve truthfulness.
It is a frankenmerge model created using mergekit, alternating layers of Nous-Hermes-2-SOLAR-10.7B and SOLAR-10.7B-Instruct. Then, we applied DPO over a high-quality preference dataset.
![image/png](https://cdn-uploads.huggingface.co/production/uploads/5fad8602b8423e1d80b8a965/rNtaTqTKrAoN5-C5DuPgu.png)