StableLM-WI-DPO / checkpoint-45 /adapter_model.safetensors

Commit History

DPO on llm-feedback v1 dataset
3d648bb

JayanthB commited on