Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding Paper • 2411.04282 • Published 24 days ago • 30
Large Language Models Can Self-Improve in Long-context Reasoning Paper • 2411.08147 • Published 18 days ago • 59