metadata

tags:
  - text-to-image
  - layout-to-image
  - stable-diffusion
  - controlnet
license: agpl-3.0
language:
  - en

Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive (ICLR 2024)

Project Page | ArXiv | Code

This model repo contains checkpoints trained on Cityscapes and ADE20K datasets using methods proposed in ALDM. For usage instructions, please refer to our Github.

Model information

ade20k_step9.ckpt and cityscapes_step9.ckpt are pretrained diffusion model weights for inference.

encoder_epoch_50.pth, decoder_epoch_50_20cls.pth and decoder_epoch_50_151cls.pth are segmentation models used for discriminator intialization in training, which are adopted from pretrained UperNet101 here.