ORPO-Finetunes
Collection
https://gist.github.com/xzuyn/87097972ab2323ced81e4d7b41c47a45
•
7 items
•
Updated
Trained on NobodyExistsOnTheInternet/ToxicQAFinal. I converted the set to a preference dataset using refusals generated from LLaMa-3-Instruct-8B.