Paper: Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization
Link: https://arxiv.org/abs/2404.00530
Github: https://github.com/Hritikbansal/dove
-