vicgalle
/

solarized-18B-truthy

Text Generation

text-generation-inference

Inference Endpoints

8-bit precision

Model card Files Files and versions Community

solarized-18B-truthy / README.md

vicgalle's picture

Update README.md

b4a01c1 verified 10 months ago

|

history blame contribute delete

452 Bytes

	---
	license: apache-2.0
	datasets:
	- jondurbin/truthy-dpo-v0.1
	---

	## Solarized-18B-truthy

	Solarized-18B-dpo fine-tuned to improve truthfulness.

	It is a frankenmerge model created using mergekit, alternating layers of Nous-Hermes-2-SOLAR-10.7B and SOLAR-10.7B-Instruct. Then, we applied DPO over a high-quality preference dataset.

	![image/png](https://cdn-uploads.huggingface.co/production/uploads/5fad8602b8423e1d80b8a965/rNtaTqTKrAoN5-C5DuPgu.png)