Aligning Large Language Models with Counterfactual DPO Paper • 2401.09566 • Published Jan 17 • 2