-
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
Paper • 2310.14566 • Published • 25 -
TouchStone: Evaluating Vision-Language Models by Language Models
Paper • 2308.16890 • Published • 1
donghunlee
hundong2
·
AI & ML interests
None yet
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet