Unraveling Cross-Modality Knowledge Conflict in Large Vision-Language Models Paper • 2410.03659 • Published Oct 4 • 6
mDPO: Conditional Preference Optimization for Multimodal Large Language Models Paper • 2406.11839 • Published Jun 17 • 37
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding Paper • 2406.09411 • Published Jun 13 • 18