Molmo Collection Artifacts for open multimodal language models. โข 5 items โข Updated 3 days ago โข 276
LLaVa-1.5 Collection LLaVa-1.5 is a series of vision-language models (VLMs) trained on a variety of visual instruction datasets. โข 3 items โข Updated Mar 18 โข 7