TinyLLaVA: A Framework of Small-scale Large Multimodal Models
Baichuan Zhou
bczhou
AI & ML interests
Computer Vision
Recent Activity
New activity
23 days ago
bczhou/LOKI:Add dataset card, link to paper
upvoted
a
paper
about 1 month ago
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large
Vision-Language Models
upvoted
a
paper
about 1 month ago
AutoTrain: No-code training for state-of-the-art models
Organizations
Collections
1
spaces
1
models
8
bczhou/tiny-llava-v1-hf
Image-Text-to-Text
•
Updated
•
3.04k
•
53
bczhou/TinyLLaVA-2.0B
Image-Text-to-Text
•
Updated
•
63
•
5
bczhou/TinyLLaVA-1.5B
Image-Text-to-Text
•
Updated
•
244
•
16
bczhou/TinyLLaVA-3.1B-Pretrain
Text Generation
•
Updated
•
8
bczhou/TinyLLaVA-3.1B
Text Generation
•
Updated
•
268
•
25
bczhou/TinyLLaVA-2.0B-SigLIP
Updated
•
114
•
1
bczhou/TinyLLaVA-1.5B-SigLIP
Updated
•
94
•
1
bczhou/TinyLLaVA-3.1B-SigLIP
Updated
•
294
•
4
datasets
7
bczhou/LOKI
Preview
•
Updated
•
52
bczhou/UrBench
Viewer
•
Updated
•
11.6k
•
34
bczhou/CityBench-SubTasks
Viewer
•
Updated
•
12.8k
•
32
bczhou/SyntheticBench-Videos
Viewer
•
Updated
•
264
•
33
bczhou/CityBench-v0.3
Viewer
•
Updated
•
9.71k
•
29
bczhou/CityBench-v0.2
Viewer
•
Updated
•
9.71k
•
30
bczhou/CityVQA-v0.2
Viewer
•
Updated
•
2.5k
•
32
•
1