-
CompCap: Improving Multimodal Large Language Models with Composite Captions
Paper • 2412.05243 • Published • 18 -
LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment
Paper • 2412.04814 • Published • 45 -
MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale
Paper • 2412.05237 • Published • 46 -
Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models
Paper • 2412.05939 • Published • 12
Exclibur
Exclibur
AI & ML interests
None yet
Recent Activity
updated
a collection
about 14 hours ago
Interest
updated
a collection
10 days ago
Interest
updated
a dataset
11 days ago
Exclibur/dibs-feature
Organizations
None yet
Collections
1
models
None public yet