BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions Paper • 2411.07461 • Published 17 days ago • 21
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16 • 97
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations Paper • 2408.12590 • Published Aug 22 • 34
Salesforce/xgen-mm-phi3-mini-instruct-singleimg-r-v1.5 Image-Text-to-Text • Updated Sep 12 • 208 • 15
Salesforce/xgen-mm-phi3-mini-instruct-dpo-r-v1.5 Image-Text-to-Text • Updated Sep 16 • 1.66k • 16
Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5 Image-Text-to-Text • Updated Sep 20 • 4.28k • 44
XGen-MM-1 models and datasets Collection A collection of all XGen-MM (Foundation LMM) models! • 15 items • Updated 24 days ago • 34