license: other
tags:
- stable-diffusion
- text-to-image
inference: false
Untitled Model Card
Japanese version is here.
Introduction
Untitled is the latent diffusion model made for AI art.
Usage
I recommend to use the model by Web UI. You can download the model here. Then, install Web UI by AUTIMATIC1111. I recommend to use the embeddings.
See also: WD 1.5 Beta - Release Notes
Examples
1girl, aqua eyes, baseball cap, blonde hair, closed mouth, earrings, green background, hat, hoop earrings, jewelry, looking at viewer, shirt, short hair, simple background, solo, upper body, yellow shirt,
(waifu, anime, exceptional, best aesthetic, new, newest, best quality, masterpiece, extremely detailed:1.2)
Negative prompt: nfixer, nfixernext, wdbadprompt, lowres, ((((bad anatomy)))), ((bad hands)), text, missing finger, extra digits, fewer digits, blurry, ((mutated hands and fingers)), (poorly drawn face), ((mutation)), ((deformed face)), (ugly), ((bad proportions)), ((extra limbs)), extra face, (double head), (extra head), ((extra feet)), monster, logo, cropped, worst quality, jpeg, humpbacked, long body, long neck, ((jpeg artifacts)), deleted, old, oldest, ((censored)), ((bad aesthetic)), (mosaic censoring, bar censor, blur censor)
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 3135505562, Size: 1024x1024, Model hash: e6eb25128a, Model: untitled
goddess, 1girl, upper body, ((symmetric)),
(waifu, anime, exceptional, best aesthetic, new, newest, best quality, masterpiece, extremely detailed:1.2)
Negative prompt: nfixer, nfixernext, wdbadprompt, lowres, ((((bad anatomy)))), ((bad hands)), text, missing finger, extra digits, fewer digits, blurry, ((mutated hands and fingers)), (poorly drawn face), ((mutation)), ((deformed face)), (ugly), ((bad proportions)), ((extra limbs)), extra face, (double head), (extra head), ((extra feet)), monster, logo, cropped, worst quality, jpeg, humpbacked, long body, long neck, ((jpeg artifacts)), deleted, old, oldest, ((censored)), ((bad aesthetic)), (mosaic censoring, bar censor, blur censor)
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 3703399597, Size: 1024x1024, Model hash: e6eb25128a, Model: untitled
goddess, 1girl, upper body, ((symmetric)),
(waifu, anime, exceptional, best aesthetic, new, newest, best quality, masterpiece, extremely detailed:1.2), embellish1
Negative prompt: nfixer, nfixernext, wdbadprompt, lowres, ((((bad anatomy)))), ((bad hands)), text, missing finger, extra digits, fewer digits, blurry, ((mutated hands and fingers)), (poorly drawn face), ((mutation)), ((deformed face)), (ugly), ((bad proportions)), ((extra limbs)), extra face, (double head), (extra head), ((extra feet)), monster, logo, cropped, worst quality, jpeg, humpbacked, long body, long neck, ((jpeg artifacts)), deleted, old, oldest, ((censored)), ((bad aesthetic)), (mosaic censoring, bar censor, blur censor)
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 3703399597, Size: 1024x1024, Model hash: e6eb25128a, Model: untitled
1girl in the ruin, full body,
(waifu, anime, exceptional, best aesthetic, new, newest, best quality, masterpiece, extremely detailed:1.2)
Negative prompt: nfixer, nfixernext, wdbadprompt, lowres, ((((bad anatomy)))), ((bad hands)), text, missing finger, extra digits, fewer digits, blurry, ((mutated hands and fingers)), (poorly drawn face), ((mutation)), ((deformed face)), (ugly), ((bad proportions)), ((extra limbs)), extra face, (double head), (extra head), ((extra feet)), monster, logo, cropped, worst quality, jpeg, humpbacked, long body, long neck, ((jpeg artifacts)), deleted, old, oldest, ((censored)), ((bad aesthetic)), (mosaic censoring, bar censor, blur censor)
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 3759230888, Size: 1024x1024, Model hash: e6eb25128a, Model: untitled
Model Details
Developed by: Robin Rombach, Patrick Esser, Alfred Increment
Model type: Diffusion-based text-to-image generation model
Language(s): English
License: CreativeML Open RAIL++-M-NC License, Fair AI Public License 1.0-SD
Model Description: This is a model that can be used to generate and modify images based on text prompts. It is a Latent Diffusion Model that uses a fixed, pretrained text encoder (OpenCLIP-ViT/H).
Resources for more information: GitHub Repository.
Cite as:
@InProceedings{Rombach_2022_CVPR, author = {Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj\"orn}, title = {High-Resolution Image Synthesis With Latent Diffusion Models}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2022}, pages = {10684-10695} }
*This model card was written by: Alfred Increment and is based on the Stable Diffusion v2