-
VisionLLaMA: A Unified LLaMA Interface for Vision Tasks
Paper • 2403.00522 • Published • 44 -
MobileVLM V2: Faster and Stronger Baseline for Vision Language Model
Paper • 2402.03766 • Published • 12 -
MobileVLM : A Fast, Reproducible and Strong Vision Language Assistant for Mobile Devices
Paper • 2312.16886 • Published • 19 -
Lenna: Language Enhanced Reasoning Detection Assistant
Paper • 2312.02433 • Published • 2
team of GV, Meituan
mtgv
AI & ML interests
None yet
Organizations
Collections
1
models
23
mtgv/SiTLLaMA-B-2
Updated
mtgv/SiTLLaMA-XL-2
Updated
mtgv/SiTLLaMA-L-2
Updated
mtgv/SiTLLaMA-S-2
Updated
mtgv/VisionLLaMA-Large-MAE
Image Classification
•
Updated
•
1
mtgv/VisionLLaMA-Base-MAE
Image Classification
•
Updated
•
1
mtgv/DiTLLaMA-L-4
Updated
•
1
mtgv/DiTLLaMA-B-4
Updated
mtgv/DiTLLaMA-XL-4
Updated
mtgv/DiTLLaMA-XL-2
Updated
•
1