metadata

license: other
tags:
  - stable-diffusion
  - text-to-image
inference: false

Untitled Model Card

Japanese version is here.

Introduction

Untitled is the latent diffusion model made for AI art.

Usage

I recommend to use the model by Web UI. You can download the model here. Then, install Web UI by AUTIMATIC1111. I recommend to use the embeddings.

Examples

1girl, aqua eyes, baseball cap, blonde hair, closed mouth, earrings, green background, hat, hoop earrings, jewelry, looking at viewer, shirt, short hair, simple background, solo, upper body, yellow shirt,
(waifu, anime, exceptional, best aesthetic, new, newest, best quality, masterpiece, extremely detailed:1.2)
Negative prompt: nfixer, nfixernext, wdbadprompt, lowres, ((((bad anatomy)))), ((bad hands)), text, missing finger, extra digits, fewer digits, blurry, ((mutated hands and fingers)), (poorly drawn face), ((mutation)), ((deformed face)), (ugly), ((bad proportions)), ((extra limbs)), extra face, (double head), (extra head), ((extra feet)), monster, logo, cropped, worst quality, jpeg, humpbacked, long body, long neck, ((jpeg artifacts)), deleted, old, oldest, ((censored)), ((bad aesthetic)), (mosaic censoring, bar censor, blur censor)
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 3135505562, Size: 1024x1024, Model hash: e6eb25128a, Model: untitled

goddess, 1girl, upper body, ((symmetric)),
(waifu, anime, exceptional, best aesthetic, new, newest, best quality, masterpiece, extremely detailed:1.2)
Negative prompt: nfixer, nfixernext, wdbadprompt, lowres, ((((bad anatomy)))), ((bad hands)), text, missing finger, extra digits, fewer digits, blurry, ((mutated hands and fingers)), (poorly drawn face), ((mutation)), ((deformed face)), (ugly), ((bad proportions)), ((extra limbs)), extra face, (double head), (extra head), ((extra feet)), monster, logo, cropped, worst quality, jpeg, humpbacked, long body, long neck, ((jpeg artifacts)), deleted, old, oldest, ((censored)), ((bad aesthetic)), (mosaic censoring, bar censor, blur censor)
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 3703399597, Size: 1024x1024, Model hash: e6eb25128a, Model: untitled

goddess, 1girl, upper body, ((symmetric)),
(waifu, anime, exceptional, best aesthetic, new, newest, best quality, masterpiece, extremely detailed:1.2), embellish1
Negative prompt: nfixer, nfixernext, wdbadprompt, lowres, ((((bad anatomy)))), ((bad hands)), text, missing finger, extra digits, fewer digits, blurry, ((mutated hands and fingers)), (poorly drawn face), ((mutation)), ((deformed face)), (ugly), ((bad proportions)), ((extra limbs)), extra face, (double head), (extra head), ((extra feet)), monster, logo, cropped, worst quality, jpeg, humpbacked, long body, long neck, ((jpeg artifacts)), deleted, old, oldest, ((censored)), ((bad aesthetic)), (mosaic censoring, bar censor, blur censor)
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 3703399597, Size: 1024x1024, Model hash: e6eb25128a, Model: untitled

1girl in the ruin, full body,
(waifu, anime, exceptional, best aesthetic, new, newest, best quality, masterpiece, extremely detailed:1.2)
Negative prompt: nfixer, nfixernext, wdbadprompt, lowres, ((((bad anatomy)))), ((bad hands)), text, missing finger, extra digits, fewer digits, blurry, ((mutated hands and fingers)), (poorly drawn face), ((mutation)), ((deformed face)), (ugly), ((bad proportions)), ((extra limbs)), extra face, (double head), (extra head), ((extra feet)), monster, logo, cropped, worst quality, jpeg, humpbacked, long body, long neck, ((jpeg artifacts)), deleted, old, oldest, ((censored)), ((bad aesthetic)), (mosaic censoring, bar censor, blur censor)
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 3759230888, Size: 1024x1024, Model hash: e6eb25128a, Model: untitled

Model Details

Developed by: Robin Rombach, Patrick Esser, Alfred Increment
Model type: Diffusion-based text-to-image generation model
Language(s): English
License: CreativeML Open RAIL++-M-NC License, AGPL-3.0
Model Description: This is a model that can be used to generate and modify images based on text prompts. It is a Latent Diffusion Model that uses a fixed, pretrained text encoder (OpenCLIP-ViT/H).
Resources for more information: GitHub Repository.

Cite as:

@InProceedings{Rombach_2022_CVPR,
    author    = {Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj\"orn},
    title     = {High-Resolution Image Synthesis With Latent Diffusion Models},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2022},
    pages     = {10684-10695}
}

*This model card was written by: Alfred Increment and is based on the Stable Diffusion v2