-
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers
Paper • 2408.06195 • Published • 62 -
Training Language Models to Self-Correct via Reinforcement Learning
Paper • 2409.12917 • Published • 135 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 33 -
Self-Reflection in LLM Agents: Effects on Problem-Solving Performance
Paper • 2405.06682 • Published • 3
Christophe Protat
Chris126
AI & ML interests
None yet
Organizations
None yet