Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Video-Text-to-Text
Any-to-Any
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Keypoint Detection
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
5,113
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
google/paligemma-3b-pt-896-keras
Image-Text-to-Text
•
Updated
16 days ago
•
35
•
2
google/paligemma-3b-mix-448-keras
Image-Text-to-Text
•
Updated
16 days ago
•
26
•
2
OpenGVLab/InternVL2-2B
Image-Text-to-Text
•
Updated
Sep 24
•
127k
•
56
OpenGVLab/InternVL2-4B
Image-Text-to-Text
•
Updated
Sep 24
•
87.1k
•
42
OpenGVLab/InternVL2-40B
Image-Text-to-Text
•
Updated
Sep 24
•
6.19k
•
93
qnguyen3/nanoLLaVA-1.5
Image-Text-to-Text
•
Updated
Sep 21
•
687
•
99
m-aliabbas1/Florence-2-FT-path-vqa
Image-Text-to-Text
•
Updated
Jun 30
•
20
•
1
OpenGVLab/InternVL2-2B-AWQ
Image-Text-to-Text
•
Updated
Sep 24
•
7.95k
•
14
mlx-community/dolphin-vision-72b-4bit
Image-Text-to-Text
•
Updated
Jul 4
•
89
•
6
openvla/openvla-7b-prismatic
Image-Text-to-Text
•
Updated
Jul 9
•
452
•
4
facebook/chameleon-30b
Image-Text-to-Text
•
Updated
Jul 30
•
921
•
82
llava-hf/llava-interleave-qwen-0.5b-hf
Image-Text-to-Text
•
Updated
Aug 20
•
5.96k
•
28
llava-hf/llava-interleave-qwen-7b-hf
Image-Text-to-Text
•
Updated
Aug 20
•
13.3k
•
23
yifeihu/TF-ID-base
Image-Text-to-Text
•
Updated
Jul 11
•
748
•
35
deepvk/llava-gemma-2b-lora
Image-Text-to-Text
•
Updated
Aug 13
•
149
•
8
royokong/e5-v
Image-Text-to-Text
•
Updated
14 days ago
•
11.9k
•
18
OpenGVLab/InternVL2-Llama3-76B
Image-Text-to-Text
•
Updated
Sep 24
•
211k
•
204
OpenGVLab/InternVL2-40B-AWQ
Image-Text-to-Text
•
Updated
Sep 24
•
732
•
17
OpenGVLab/InternVL2-26B-AWQ
Image-Text-to-Text
•
Updated
Sep 24
•
419
•
19
OpenGVLab/InternVL2-Llama3-76B-AWQ
Image-Text-to-Text
•
Updated
Sep 24
•
1.14k
•
24
llava-hf/llama3-llava-next-8b-hf
Image-Text-to-Text
•
Updated
Aug 16
•
75.1k
•
27
REILX/llava-Qwen2-7B-Instruct-Chinese-CLIP-v2
Image-Text-to-Text
•
Updated
21 days ago
•
74
•
4
markury/AndroGemma-alpha
Image-Text-to-Text
•
Updated
Jul 22
•
9
•
3
qresearch/llama-3.1-8B-vision-378
Image-Text-to-Text
•
Updated
Aug 6
•
689
•
33
deepvk/llava-saiga-8b
Image-Text-to-Text
•
Updated
Aug 13
•
211
•
15
yifeihu/TFT-ID-1.0
Image-Text-to-Text
•
Updated
Sep 29
•
529
•
102
natong19/InternVL2-8B-abliterated
Image-Text-to-Text
•
Updated
Jul 27
•
140
•
2
REILX/llava-Qwen2-7B-Instruct-Chinese-CLIP-v3
Image-Text-to-Text
•
Updated
21 days ago
•
52
•
3
osunlp/UGround
Image-Text-to-Text
•
Updated
27 days ago
•
22.6k
•
17
ucsahin/TraVisionLM-base
Image-Text-to-Text
•
Updated
Aug 9
•
83
•
22
Previous
1
...
4
5
6
7
8
...
100
Next