Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2401.17093

SVGDreamer: Text Guided SVG Generation with Diffusion Model

Paper • 2312.16476 • Published Dec 27, 2023
DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models

Paper • 2306.14685 • Published Jun 26, 2023 • 1
Beyond Pixels: Exploring Human-Readable SVG Generation for Simple Images with Vision Language Models

Paper • 2311.15543 • Published Nov 27, 2023
StarVector: Generating Scalable Vector Graphics Code from Images

Paper • 2312.11556 • Published Dec 17, 2023 • 27

Learning Universal Predictors

Paper • 2401.14953 • Published Jan 26 • 19
Anything in Any Scene: Photorealistic Video Object Insertion

Paper • 2401.17509 • Published Jan 30 • 16
SymbolicAI: A framework for logic-based approaches combining generative models and solvers

Paper • 2402.00854 • Published Feb 1 • 19
StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis

Paper • 2401.17093 • Published Jan 30 • 19

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Paper • 2312.16862 • Published Dec 28, 2023 • 30
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

Paper • 2312.17172 • Published Dec 28, 2023 • 26
Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers

Paper • 2401.01974 • Published Jan 3 • 5
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations

Paper • 2401.01885 • Published Jan 3 • 27

Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding

Paper • 2306.06094 • Published Jun 9, 2023 • 1
IconShop: Text-Guided Vector Icon Synthesis with Autoregressive Transformers

Paper • 2304.14400 • Published Apr 27, 2023 • 4
VecFusion: Vector Font Generation with Diffusion

Paper • 2312.10540 • Published Dec 16, 2023 • 21
StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis

Paper • 2401.17093 • Published Jan 30 • 19

image_ref_controller

blink7630/storyboard-sketch

Text-to-Image • Updated Nov 14, 2023 • 2.91k • • 61
zoheb/sketch-scene

Viewer • Updated Oct 30, 2022 • 10k • 11.5k • 17
TencentARC/t2i-adapter-lineart-sdxl-1.0

Image-to-Image • Updated Sep 7, 2023 • 7.94k • 70
Running on A10G

124

🧑‍🎨

HD-Painter

Kosmos-2.5: A Multimodal Literate Model

Paper • 2309.11419 • Published Sep 20, 2023 • 50
Mirasol3B: A Multimodal Autoregressive model for time-aligned and contextual modalities

Paper • 2311.05698 • Published Nov 9, 2023 • 9
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 84
PolyMaX: General Dense Prediction with Mask Transformer

Paper • 2311.05770 • Published Nov 9, 2023 • 6

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs