Optimizing diffusion models - a sayakpaul Collection

Progressive Distillation for Fast Sampling of Diffusion Models

Paper • 2202.00512 • Published Feb 1, 2022 • 1

Note Introduces the idea of progressively distilling a diffusion model that requires fewer timesteps to sample from.

On Distillation of Guided Diffusion Models

Paper • 2210.03142 • Published Oct 6, 2022

InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation

Paper • 2309.06380 • Published Sep 12, 2023 • 32

Note Combines the benefits of timestep distillation and faster ODE solver-based sampling. Custom diffusers pipeline: https://github.com/huggingface/diffusers/blob/main/examples/community/instaflow_one_step.py.

Consistency Models

Paper • 2303.01469 • Published Mar 2, 2023 • 8

Note Introduces a new framework for distillation by learning to map any point in a probability flow ordinary differential equation (ODE )to its origin on the trajectory. CMs also allow for 1-4 step sampling. Play with them: https://huggingface.co/docs/diffusers/api/pipelines/consistency_models. Unconditional consistency model training: https://github.com/huggingface/diffusers/tree/main/examples/research_projects/consistency_training.

Improved Techniques for Training Consistency Models

Paper • 2310.14189 • Published Oct 22, 2023

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Paper • 2310.04378 • Published Oct 6, 2023 • 19

Note Extends Consistency Models by operating on the latent space. Also introduces Latent Consistency Fine-tuning for training on custom datasets. Play with them: https://huggingface.co/docs/diffusers/main/en/api/pipelines/latent_consistency_models Train your own: https://github.com/huggingface/diffusers/tree/main/examples/consistency_distillation

LCM-LoRA: A Universal Stable-Diffusion Acceleration Module

Paper • 2311.05556 • Published Nov 9, 2023 • 81

Note Builds on top of LCM and acts as a plugin for pre-trained Stable Diffusion models to enable faster few-step inference. Play with them: https://huggingface.co/docs/diffusers/main/en/using-diffusers/inference_with_lcm_lora Train your own: https://github.com/huggingface/diffusers/tree/main/examples/consistency_distillation

UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs

Paper • 2311.09257 • Published Nov 14, 2023 • 45

Note Enables one-step sampling through a GAN forward pass without running the reverse process. Community implementation: https://github.com/huggingface/diffusers/pull/6133.

Adversarial Diffusion Distillation

Paper • 2311.17042 • Published Nov 28, 2023 • 2

Note Introduces a way to do GAN-style training to enable few-step inference. Combining GANs and Diffusion isn't new. Refer to this paper for references. This paper made the SD-Turbo and SDXL-Turbo possible. Play with them: https://huggingface.co/docs/diffusers/using-diffusers/sdxl_turbo. A training script is also being added here: https://github.com/huggingface/diffusers/pull/6303.

Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation

Paper • 2403.12015 • Published Mar 18 • 64

Note Successor of ADD (Adversarial Diffusion Distillation).

On Architectural Compression of Text-to-Image Diffusion Models

Paper • 2305.15798 • Published May 25, 2023 • 4

Note Popularly known as "BK-SDM". Up until now, we were mainly focusing on distilling to enable few-step inference. Those techniques necessarily don't concentrate on architectural compression. BK-SDM models (compatible with diffusers): https://huggingface.co/nota-ai

Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level Loss

Paper • 2401.02677 • Published Jan 5 • 22

Note Builds on top of BK-SDM. This report presents an overview of the things that made SSD-1B and Vega family of architecturally compressed models work so well. Find the models: https://huggingface.co/Segmind

SDXL-Lightning: Progressive Adversarial Diffusion Distillation

Paper • 2402.13929 • Published Feb 21 • 28

Note Combines progressive and adversarial distillation to achieve a balance between quality and mode coverage. Available checkpoints: https://huggingface.co/ByteDance/SDXL-Lightning (compatible with `diffusers`).

Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis

Paper • 2404.13686 • Published Apr 21 • 27

Note `diffusers` compatible implementation: https://hyper-sd.github.io/.

Note It gradually merges redundant tokens, thereby accelerating inference. It supports diffusers: https://huggingface.co/docs/diffusers/main/en/optimization/tome.

DeepCache: Accelerating Diffusion Models for Free

Paper • 2312.00858 • Published Dec 1, 2023 • 21

Note It shows how the UNet features computed in the earlier reverse diffusion steps can be largely reused during the latter steps. This saves computation, thereby accelerating inference. It supports diffusers: https://huggingface.co/docs/diffusers/main/en/optimization/deepcache.

thibaud/sdxl_dpo_turbo

Text-to-Image • Updated Dec 20, 2023 • 522 • 83

Note What if you could combine two models coming from the same family? In this example, SDXL Turbo params and SDXL DPO params are averaged and we obtain a new model. This model enjoys alignment benefits from DPO and few-step sampling from Turbo.

Deci-early-access/DeciDiffusion-v2-0

Text-to-Image • Updated Jan 15 • 11 • 3

Note DeciDiffusion-v2 is a faster model than Stable Diffusion v1.5 while producing similar-caliber image quality. It benefits from an improved training recipe. Refer to the model card to know more.

jasperai/flash-sd3

Text-to-Image • Updated Jun 27 • 2.11k • 107