license: apache-2.0 | |
datasets: | |
- jondurbin/truthy-dpo-v0.1 | |
## Solarized-18B-truthy | |
Solarized-18B-dpo fine-tuned to improve truthfulness. | |
It is a frankenmerge model created using mergekit, alternating layers of Nous-Hermes-2-SOLAR-10.7B and SOLAR-10.7B-Instruct. Then, we applied DPO over a high-quality preference dataset. | |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/5fad8602b8423e1d80b8a965/rNtaTqTKrAoN5-C5DuPgu.png) |