The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio Paper • 2410.12787 • Published 22 days ago • 30
MobA: A Two-Level Agent System for Efficient Mobile Task Automation Paper • 2410.13757 • Published 21 days ago • 30
MULTI: Multimodal Understanding Leaderboard with Text and Images Paper • 2402.03173 • Published Feb 5 • 3