ViDoRe Benchmark Collection Benchmark for document retrieval using visual features, introduced in the ColPali paper. Datasets are using the QA format. โข 10 items โข Updated 10 days ago โข 11
view article Article ColPali: Efficient Document Retrieval with Vision Language Models ๐ By manu โข Jul 5 โข 165
view article Article Introduction to Quantization cooked in ๐ค with ๐๐งโ๐ณ By merve โข Aug 25, 2023 โข 19
view article Article A Dive into Pretraining Strategies for Vision-Language Models Feb 3, 2023 โข 49
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? Paper โข 2403.14624 โข Published Mar 21 โข 51