Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality Paper • 2410.05210 • Published Oct 7 • 10
Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition Paper • 2406.09388 • Published Jun 13