MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning Paper • 2409.20566 • Published 1 day ago • 29
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 180 items • Updated about 11 hours ago • 24
Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published 13 days ago • 127