WorldSimBench: Towards Video Generation Models as World Simulators Paper • 2410.18072 • Published Oct 23 • 18
LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark Paper • 2306.06687 • Published Jun 11, 2023 • 1
ChEF: A Comprehensive Evaluation Framework for Standardized Assessment of Multimodal Large Language Models Paper • 2311.02692 • Published Nov 5, 2023 • 1
Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models Paper • 2312.08962 • Published Dec 14, 2023
From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities Paper • 2401.15071 • Published Jan 26 • 35
From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities Paper • 2401.15071 • Published Jan 26 • 35