GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation Paper • 2312.04557 • Published Dec 7, 2023 • 12
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models Paper • 2312.04410 • Published Dec 7, 2023 • 14
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding Paper • 2312.04461 • Published Dec 7, 2023 • 57
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively Paper • 2401.02955 • Published Jan 5 • 20
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation Paper • 2312.16272 • Published Dec 26, 2023 • 6
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion Paper • 2312.16486 • Published Dec 27, 2023 • 6
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models Paper • 2411.07126 • Published about 1 month ago • 28
Motion Control for Enhanced Complex Action Video Generation Paper • 2411.08328 • Published 29 days ago • 4
JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation Paper • 2411.07975 • Published 30 days ago • 26
Pyramidal Flow Matching for Efficient Video Generative Modeling Paper • 2410.05954 • Published Oct 8 • 38
Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation Paper • 2412.04432 • Published 7 days ago • 11
LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment Paper • 2412.04814 • Published 6 days ago • 38
Mind the Time: Temporally-Controlled Multi-Event Video Generation Paper • 2412.05263 • Published 6 days ago • 8
OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows Paper • 2412.01169 • Published 10 days ago • 10