Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
merve
's Collections
Nov 22 Releases βοΈ
Nov 15 Releases π
Nov 1 Releases
MIT Talk 31/10 Papers
October 25 Releases
LOTUS πͺ·
New Depth Models
BRAVE Models π¦
Computer Vision Backbones π§©
Image Classification Models πΆ π±
Object Detection Models π₯₯
Image Segmentation Models π
Zero-shot Image Classification Models πΌοΈ
Image-to-Image Models π¨
Video Classification Models πΊ
Image-to-Text Models π
Text-to-Image Models π₯
Foundation Models for Vision π§©
Segment Anything Model
OWL-series π¦
SigLIP
Awesome Document AI
SegGPT
Vision Language Models Papers πΌοΈπ¬π
gvhf/owl
gv-hf/owl
merve/owl2
Depth Anything v2 Release
Document VLM Papers
Vision Language Leaderboards
Video Language Models
SAM2
NVEagle
Multimodal RAG
Zero-shot Segmentation
October 25 Releases
updated
Oct 25
Upvote
7
ibm-granite/granite-3.0-8b-instruct
Text Generation
β’
Updated
Oct 23
β’
50.8k
β’
180
ibm-granite/granite-3.0-2b-instruct
Text Generation
β’
Updated
Oct 23
β’
22.1k
β’
41
CohereForAI/aya-expanse-8b
Text Generation
β’
Updated
29 days ago
β’
48.3k
β’
288
CohereForAI/aya-expanse-32b
Text Generation
β’
Updated
27 days ago
β’
32.3k
β’
172
genmo/mochi-1-preview
Text-to-Video
β’
Updated
6 days ago
β’
45.4k
β’
1.03k
rhymes-ai/Allegro
Text-to-Video
β’
Updated
27 days ago
β’
2.06k
β’
238
LanguageBind/Open-Sora-Plan-v1.3.0
Updated
about 1 month ago
β’
16
β’
50
jadechoghari/Ferret-UI-Llama8b
Image-Text-to-Text
β’
Updated
Oct 18
β’
1.46k
β’
51
jadechoghari/Ferret-UI-Gemma2b
Image-Text-to-Text
β’
Updated
Oct 18
β’
1.89k
β’
47
microsoft/OmniParser
Image-Text-to-Text
β’
Updated
1 day ago
β’
12k
β’
1.37k
neuralwork/arxiver
Viewer
β’
Updated
26 days ago
β’
63.4k
β’
2.94k
β’
350
neulab/Pangea-7B
Updated
Oct 24
β’
5.26k
β’
120
neulab/Pangea-7B-hf
Updated
about 1 month ago
β’
2.77k
β’
7
Running
48
π
Pangea
A Fully Open Multilingual Multimodal LLM for 39 Languages
stabilityai/stable-diffusion-3.5-large
Text-to-Image
β’
Updated
Oct 22
β’
189k
β’
β’
1.38k
stabilityai/stable-diffusion-3.5-large-turbo
Text-to-Image
β’
Updated
Oct 22
β’
107k
β’
β’
326
Marqo/marqo-GS-10M
Viewer
β’
Updated
Oct 23
β’
9.81M
β’
2.54k
β’
45
vikhyatk/lofi
Viewer
β’
Updated
Oct 26
β’
857k
β’
17.7k
β’
71
neulab/PangeaInstruct
Updated
Oct 25
β’
1.09k
β’
78
Upvote
7
+3
Share collection
View history
Collection guide
Browse collections