Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
1
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Video-Text-to-Text
Any-to-Any
Audio-Text-to-Text
Computer Vision
Image Classification
Object Detection
Video Classification
Image Segmentation
Image-to-Text
Zero-Shot Image Classification
Image Feature Extraction
Mask Generation
Depth Estimation
Text-to-Image
Zero-Shot Object Detection
Unconditional Image Generation
Image-to-Image
Image-to-3D
Keypoint Detection
Text-to-Video
Text-to-3D
Image-to-Video
Natural Language Processing
Text Generation
Text Classification
Text2Text Generation
Token Classification
Fill-Mask
Question Answering
Feature Extraction
Translation
Sentence Similarity
Summarization
Zero-Shot Classification
Table Question Answering
Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Text-to-Speech
Text-to-Audio
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
361
Full-text search
Edit filters
Sort: Trending
Active filters:
visual-question-answering, transformers
Clear all
google/deplot
Visual Question Answering
•
Updated
Sep 6, 2023
•
8.8k
•
262
openbmb/MiniCPM-V-2
Visual Question Answering
•
Updated
Aug 6
•
7.56k
•
428
OpenFace-CQUPT/Human_LLaVA
Visual Question Answering
•
Updated
27 days ago
•
1.37k
•
35
IDEA-FinAI/chartmoe
Visual Question Answering
•
Updated
Sep 10
•
102
•
8
BAAI/Aquila-VL-2B-llava-qwen
Visual Question Answering
•
Updated
8 days ago
•
3.08k
•
49
dandelin/vilt-b32-finetuned-vqa
Visual Question Answering
•
Updated
Aug 2, 2022
•
72.6k
•
•
391
microsoft/git-base-vqav2
Visual Question Answering
•
Updated
Mar 9
•
593
•
16
Salesforce/blip-vqa-base
Visual Question Answering
•
Updated
Dec 7, 2023
•
256k
•
132
google/pix2struct-widget-captioning-large
Visual Question Answering
•
Updated
Apr 10
•
40
•
15
google/matcha-chart2text-pew
Visual Question Answering
•
Updated
Jul 22, 2023
•
371
•
28
google/matcha-chartqa
Visual Question Answering
•
Updated
Jul 22, 2023
•
1k
•
38
MBZUAI/Video-ChatGPT-7B
Visual Question Answering
•
Updated
Jun 8, 2023
•
38
mlpc-lab/BLIVA_Vicuna
Visual Question Answering
•
Updated
Aug 23, 2023
•
5
jalbrechts/vilt-finetuned-fashion-vqa
Visual Question Answering
•
Updated
Oct 26, 2023
•
57
•
1
internlm/internlm-xcomposer2-vl-7b
Visual Question Answering
•
Updated
Apr 12
•
3.83k
•
79
openbmb/MiniCPM-V
Visual Question Answering
•
Updated
Aug 6
•
2.99k
•
135
surya47/medclip-roco
Visual Question Answering
•
Updated
Feb 7
•
8
•
2
yanka9/vilt_finetuned_deepfashionVQA_v2
Visual Question Answering
•
Updated
Jul 1
•
187
•
3
openbmb/MiniCPM-Llama3-V-2_5-int4
Visual Question Answering
•
Updated
Jul 22
•
22.8k
•
70
Lin-Chen/sharegpt4video-8b
Visual Question Answering
•
Updated
Jul 1
•
1.24k
•
42
jihadzakki/idefics2-8b-vqarad-delta
Visual Question Answering
•
Updated
Jun 13
•
36
•
3
DAMO-NLP-SG/VideoLLaMA2-7B-16F
Visual Question Answering
•
Updated
Aug 13
•
977
•
14
internlm/internlm-xcomposer2d5-7b
Visual Question Answering
•
Updated
Jul 22
•
8.59k
•
183
radna/Triton-InternVL2-2B
Visual Question Answering
•
Updated
Jul 4
•
69
•
3
RussRobin/SpatialBot-3B-LoRA
Visual Question Answering
•
Updated
Sep 5
•
6
•
3
RussRobin/SpatialBot-3B
Visual Question Answering
•
Updated
Sep 10
•
252
•
9
DAMO-NLP-SG/VideoLLaMA2-72B
Visual Question Answering
•
Updated
Aug 14
•
545
•
10
Aliayub1995/VideoLLaMA2-7B
Visual Question Answering
•
Updated
Sep 4
•
19
•
1
erax-ai/EraX-VL-7B-V1.0
Visual Question Answering
•
Updated
Oct 22
•
8.42k
•
27
DAMO-NLP-SG/VideoLLaMA2.1-7B-16F
Visual Question Answering
•
Updated
Oct 21
•
3.56k
•
7
Previous
1
2
3
...
13
Next