Aligning Large Language Models via Self-Steering Optimization Paper • 2410.17131 • Published 29 days ago • 21
Modulated Intervention Preference Optimization (MIPO): Keep the Easy, Refine the Difficult Paper • 2409.17545 • Published Sep 26 • 18