Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
carlizor
's Collections
3D Generation
LLM
Embedding
LLM - Small
Video vision
To Read
Video
Image Segmentation
Image Generation (Fast)
Image Depth
Image caption
Audio
Image Generation
Image that talks
Image Enhance
Image Vision
Image editing
Image upscaling
Face Recognition
Multimodal
LLM - Medium
Image Vision
updated
14 days ago
Upvote
-
Salesforce/xgen-mm-phi3-mini-instruct-r-v1
Image-Text-to-Text
•
Updated
Sep 18
•
60.9k
•
184
AIDC-AI/Ovis1.6-Gemma2-9B
Image-Text-to-Text
•
Updated
17 days ago
•
7.58k
•
239
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
21 days ago
•
31.5k
•
730
microsoft/OmniParser
Image-Text-to-Text
•
Updated
6 days ago
•
5.5k
•
1.08k
Upvote
-
Share collection
View history
Collection guide
Browse collections