DynaSaur: Large Language Agents Beyond Predefined Actions Paper • 2411.01747 • Published 6 days ago • 13
LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding Paper • 2306.17107 • Published Jun 29, 2023 • 11