Trained on NobodyExistsOnTheInternet/ToxicQAFinal. I converted the set to a preference dataset using refusals generated from LLaMa-3-Instruct-8B.
Trained on NobodyExistsOnTheInternet/ToxicQAFinal. I converted the set to a preference dataset using refusals generated from LLaMa-3-Instruct-8B.