euclid-multimodal/Euclid-convnext-xxlarge-120524 Question Answering • Updated about 8 hours ago • 35 • 2
euclid-multimodal/Euclid-convnext-large-120524 Question Answering • Updated about 8 hours ago • 41 • 2
ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer Paper • 2412.07720 • Published 3 days ago • 28
Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions Paper • 2412.08737 • Published 1 day ago • 19
euclid-multimodal/Euclid-convnext-xxlarge-120524 Question Answering • Updated about 8 hours ago • 35 • 2
euclid-multimodal/Euclid-convnext-large-120524 Question Answering • Updated about 8 hours ago • 41 • 2
euclid-multimodal/Euclid-convnext-large-120524 Question Answering • Updated about 8 hours ago • 41 • 2
MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning Paper • 2404.13591 • Published Apr 21 • 2
ViCrop: Perceiving Small Visual Details in Zero-shot Visual Question Answering with Multimodal Large Language Models Paper • 2310.16033 • Published Oct 24, 2023
The Curious Case of Nonverbal Abstract Reasoning with Multi-Modal Large Language Models Paper • 2401.12117 • Published Jan 22 • 1
MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning Paper • 2404.13591 • Published Apr 21 • 2