hollowstrawberry
/

stable-diffusion-guide

@@ -8,26 +8,26 @@ tags:
 - stable-diffusion-webui
 ---
-# Index <a name="index"></a>
-* [Introduction](#intro)
-* [Installation](#install)
-* [Getting Started](#start)
-    1. [Edit your starting parameters](#params)
-    1. [Getting a model](#model)
-    1. [Getting a VAE](#vae)
-    1. [Launching and settings](#launch)
-    1. [Prompts](#prompt)
-    1. [Generation parameters](#gen)
-* [Extensions](#extensions)
-* [Loras](#lora)
-* [Upscalers](#upscale)
 * ControlNet
 * Tips for training character Loras
 &nbsp;
-# Introduction <a name="intro"></a>[▲](#index)
 Stable Diffusion is a very powerful AI image generation software you can run on your own home computer. It uses "models", which function like the brain of the AI, and can make almost anything given that someone has trained it to do it. The biggest uses are anime art, photorealism, and NSFW content.
@@ -35,7 +35,7 @@ The images you create may be used for any purpose, depending on the used model's
 &nbsp;
-# Installation <a name="install"></a>[▲](#index)
 * __**Requirements:**__ To run Stable Diffusion on your own computer you'll need at least 16 GB of RAM and 4 GB of VRAM. I will only cover the case where you are running Windows 10/11 and using an NVIDIA graphics card series 16XX, 20XX or 30XX (though 10XX also work). AMD users are out of luck, as it's very inconsistent to get it working. Same with Linux and Mac, though it's possible to do it.
@@ -45,11 +45,11 @@ The images you create may be used for any purpose, depending on the used model's
 &nbsp;
-# Getting Started <a name="start"></a>[▲](#index)
 Before generating some images, here are some useful steps you can follow to improve your experience.
-1. **Edit your starting parameters** <a name="params"></a>[↑](#index)
     If you're using the collab, skip this step.
@@ -60,7 +60,7 @@ Before generating some images, here are some useful steps you can follow to impr
     * If you're using the original stable-diffusion-webui, you can add these parameters by editing your webui-user.bat, right next to `set COMMANDLINE_ARGS=`
     * Full list of possible parameters [here](https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Command-Line-Arguments-and-Settings)
-1. **Getting a model** <a name="model"></a>[▲](#index)
     The model is the brain of your AI, designed for the purpose of producing certain types of images. There are many options, most of which are on [civitai](https://civitai.com). But which to choose? These are my recommendations:
     * For anime, [7th Heaven Mix](https://civitai.com/models/4669/corneos-7th-heaven-mix) has a nice aesthetic similar to anime movies, while [Abyss Orange Mix 3](https://civitai.com/models/9942/abyssorangemix3-aom3) *(__Note:__ scroll down and choose the AOM3 option)* offers more realism in the form of advanced lighting and softer shading, as well as more lewdness. I remixed the two options above into [Heaven Orange Mix](https://civitai.com/models/14305/heavenorangemix). While AOM3 is extremely capable for NSFW, the popular [Grapefruit](https://civitai.com/models/2583/grapefruit-hentai-model) hentai model may also fit your needs.
@@ -74,7 +74,7 @@ Before generating some images, here are some useful steps you can follow to impr
     Please note that models in the format `.safetensors` are safe to use while `.ckpt` **may** contain viruses. Be careful.
-1. **Getting a VAE** <a name="vae"></a>[▲](#index)
     Most models don't come with a VAE built in. The VAE is a small separate model, which "converts your image from AI format into human format". Without it, you'll get faded colors and ugly eyes, among other things.
@@ -87,7 +87,7 @@ Before generating some images, here are some useful steps you can follow to impr
     *Collab:* You will have to place it in your Google Drive, in `MyDrive/sd/stable-diffusion-webui/models/VAE`.
-1. **Launching and settings** <a name="launch"></a>[▲](#index)
     It is finally time to launch the WebUI.
     *Launcher:* Press the button on the launcher and wait patiently for it to start. Then, it will open the interface in your browser. It's like a website, but on your computer.
@@ -98,7 +98,7 @@ Before generating some images, here are some useful steps you can follow to impr
     * In the *User Interface* section, scroll down to **Quicksettings list** and change it to `sd_model_checkpoint, sd_vae`.
     * Scroll back up, click the big orange **Apply settings** button, then **Reload UI** next to it. You can now change your model as well as your VAE from the top of the page at any time.
-1. **Prompts** <a name="prompt"></a>[▲](#index)
     On the first tab, **txt2img**, you'll be making most of your images. This is where you'll find your *prompt* and *negative prompt*.
     Stable Diffusion is not like Midjourney or other popular image generation software, you can't just ask it what you want and get a good image. You have to be specific. *Very* specific.
@@ -120,7 +120,7 @@ Before generating some images, here are some useful steps you can follow to impr
     Note that when you surround something in `(parentheses)`, it will have emphasis or more **weight** in your resulting image, equal to `1.1`. The normal weight is 1, and each parentheses will multiply by an additional 1.1. You can also specify the weight yourself, like this: `(full body:1.4)`. You can also go below 1 to de-emphasize a word: `[brackets]` will multiply by 0.9, but you still use normal parentheses to go lower, like `(this:0.5)`.
-1. **Generation parameters** <a name="gen"></a>[▲](#index)
     * *Sampling method:* These dictate how your image is formulated, and each produce different results. The default of `Euler a` is almost always the best. There are also very good results for `DPM++ 2M Karras` and `DPM++ SDE Karras`.
     * *Sampling steps:* These are "calculated" beforehand, and so more steps doesn't always mean more detail. I always go with 30, you may go from 20-50 and find good results.
@@ -130,7 +130,7 @@ Before generating some images, here are some useful steps you can follow to impr
     * *Seed:* A number that guides the creation of your image. The same seed with the same prompt and parameters produces almost exacly the same image every time.
     *Hires. fix:* Lets you create larger images without distortion. Usually used at 2x scale. When selected, more options appear:
-    * *Upscaler:* The algorithm to upscale with. `Latent` and its variations produce creative results, and you may also like `R-ESRGAN 4x+` and its anime version. Also see [Upscalers](#upscale).
     * *Hires steps:* I recommend at least half as many as your sampling steps. Higher values aren't always better.
     * *Denoising strength:* The most important parameter. Near 0, no detail will be added to the image. Near 1, the image will be changed completely. I recommend something between 0.2 and 0.6 depending on the image.
@@ -141,7 +141,7 @@ Before generating some images, here are some useful steps you can follow to impr
 &nbsp;
-# Extensions <a name="extensions"></a>[▲](#index)
 *Stable Diffusion WebUI* supports extensions to add additional functionality and quality of life. These can be added by going into the **Extensions** tab, then **Install from URL**, and pasting the links found here or elsewhere. Then, click *Install* and wait for it to finish. Then, go to **Installed** and click *Apply and restart UI*.
@@ -156,19 +156,19 @@ Here are some useful extensions, I hugely recommend the first 2:
 &nbsp;
-# Loras <a name="lora"></a>[▲](#index)
-LoRA or *Low-Rank Adaptation* is a form of **Extra Network** and the latest technology that lets you append a smaller model to any of your full models. They are similar to embeddings, one of which you might've seen [earlier](#prompt), but Loras are larger and often more capable. Technical details omitted.
 Loras can represent a character, an artstyle, poses, clothes, or even a human face (though I do not endorse this). Models are usually capable enough for general work, but when it comes to specific details with little existing examples, they fall short. That's where Loras come in. They can be downloaded from [civitai](https://civitai.com) and are 144 MB by default, but they can go as low as 1 MB and sometimes several hundreds of MB. Bigger Loras are not necessarily better. They come in `.safetensor` format, same as models.
-Place your lora files in the `stable-diffusion-webui/models/Lora` folder. Then, look for the 🎴 *Show extra networks* button below the big orange Generate button. It will open a new section. Click on the Lora tab and press the **Refresh** button, and your loras should appear. When you click a Lora in that menu it will get added to your prompt. It will look like this: `<lora:filename:1` The start is always the same. The filename will be the exact filename in your system without the `.safetensors` extension. Finally, the number is the weight, like we saw in [Prompts](#prompt). Most Loras work between 0.5 and 1 weight, and too high values might "fry" your image.
 An example of a Lora is [Thicker Lines Anime Style](https://civitai.com/models/13910/thicker-lines-anime-style-lora-mix), which is perfect if you want your images to look more like traditional anime.
 &nbsp;
-# Upscalers <a name="upscale"></a>[▲](#index)
 You can download additional upscalers and put them in your `stable-diffusion-webui/models/ESRGAN` folder.

 - stable-diffusion-webui
 ---
+# Index <a name="sdguide-index"></a>
+* [Introduction](#sdguide-intro)
+* [Installation](#sdguide-install)
+* [Getting Started](#sdguide-start)
+    1. [Edit your starting parameters](#sdguide-params)
+    1. [Getting a model](#sdguide-model)
+    1. [Getting a VAE](#sdguide-vae)
+    1. [Launching and settings](#sdguide-launch)
+    1. [Prompts](#sdguide-prompt)
+    1. [Generation parameters](#sdguide-gen)
+* [Extensions](#sdguide-extensions)
+* [Loras](#sdguide-lora)
+* [Upscalers](#sdguide-upscale)
 * ControlNet
 * Tips for training character Loras
 &nbsp;
+# Introduction <a name="sdguide-intro"></a>[▲](#sdguide-index)
 Stable Diffusion is a very powerful AI image generation software you can run on your own home computer. It uses "models", which function like the brain of the AI, and can make almost anything given that someone has trained it to do it. The biggest uses are anime art, photorealism, and NSFW content.
 &nbsp;
+# Installation <a name="sdguide-install"></a>[▲](#sdguide-index)
 * __**Requirements:**__ To run Stable Diffusion on your own computer you'll need at least 16 GB of RAM and 4 GB of VRAM. I will only cover the case where you are running Windows 10/11 and using an NVIDIA graphics card series 16XX, 20XX or 30XX (though 10XX also work). AMD users are out of luck, as it's very inconsistent to get it working. Same with Linux and Mac, though it's possible to do it.
 &nbsp;
+# Getting Started <a name="sdguide-start"></a>[▲](#sdguide-index)
 Before generating some images, here are some useful steps you can follow to improve your experience.
+1. **Edit your starting parameters** <a name="sdguide-params"></a>[▲](#sdguide-index)
     If you're using the collab, skip this step.
     * If you're using the original stable-diffusion-webui, you can add these parameters by editing your webui-user.bat, right next to `set COMMANDLINE_ARGS=`
     * Full list of possible parameters [here](https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Command-Line-Arguments-and-Settings)
+1. **Getting a model** <a name="sdguide-model"></a>[▲](#sdguide-index)
     The model is the brain of your AI, designed for the purpose of producing certain types of images. There are many options, most of which are on [civitai](https://civitai.com). But which to choose? These are my recommendations:
     * For anime, [7th Heaven Mix](https://civitai.com/models/4669/corneos-7th-heaven-mix) has a nice aesthetic similar to anime movies, while [Abyss Orange Mix 3](https://civitai.com/models/9942/abyssorangemix3-aom3) *(__Note:__ scroll down and choose the AOM3 option)* offers more realism in the form of advanced lighting and softer shading, as well as more lewdness. I remixed the two options above into [Heaven Orange Mix](https://civitai.com/models/14305/heavenorangemix). While AOM3 is extremely capable for NSFW, the popular [Grapefruit](https://civitai.com/models/2583/grapefruit-hentai-model) hentai model may also fit your needs.
     Please note that models in the format `.safetensors` are safe to use while `.ckpt` **may** contain viruses. Be careful.
+1. **Getting a VAE** <a name="sdguide-vae"></a>[▲](#sdguide-index)
     Most models don't come with a VAE built in. The VAE is a small separate model, which "converts your image from AI format into human format". Without it, you'll get faded colors and ugly eyes, among other things.
     *Collab:* You will have to place it in your Google Drive, in `MyDrive/sd/stable-diffusion-webui/models/VAE`.
+1. **Launching and settings** <a name="sdguide-launch"></a>[▲](#sdguide-index)
     It is finally time to launch the WebUI.
     *Launcher:* Press the button on the launcher and wait patiently for it to start. Then, it will open the interface in your browser. It's like a website, but on your computer.
     * In the *User Interface* section, scroll down to **Quicksettings list** and change it to `sd_model_checkpoint, sd_vae`.
     * Scroll back up, click the big orange **Apply settings** button, then **Reload UI** next to it. You can now change your model as well as your VAE from the top of the page at any time.
+1. **Prompts** <a name="sdguide-prompt"></a>[▲](#sdguide-index)
     On the first tab, **txt2img**, you'll be making most of your images. This is where you'll find your *prompt* and *negative prompt*.
     Stable Diffusion is not like Midjourney or other popular image generation software, you can't just ask it what you want and get a good image. You have to be specific. *Very* specific.
     Note that when you surround something in `(parentheses)`, it will have emphasis or more **weight** in your resulting image, equal to `1.1`. The normal weight is 1, and each parentheses will multiply by an additional 1.1. You can also specify the weight yourself, like this: `(full body:1.4)`. You can also go below 1 to de-emphasize a word: `[brackets]` will multiply by 0.9, but you still use normal parentheses to go lower, like `(this:0.5)`.
+1. **Generation parameters** <a name="sdguide-gen"></a>[▲](#sdguide-index)
     * *Sampling method:* These dictate how your image is formulated, and each produce different results. The default of `Euler a` is almost always the best. There are also very good results for `DPM++ 2M Karras` and `DPM++ SDE Karras`.
     * *Sampling steps:* These are "calculated" beforehand, and so more steps doesn't always mean more detail. I always go with 30, you may go from 20-50 and find good results.
     * *Seed:* A number that guides the creation of your image. The same seed with the same prompt and parameters produces almost exacly the same image every time.
     *Hires. fix:* Lets you create larger images without distortion. Usually used at 2x scale. When selected, more options appear:
+    * *Upscaler:* The algorithm to upscale with. `Latent` and its variations produce creative results, and you may also like `R-ESRGAN 4x+` and its anime version. Also see [Upscalers](#sdguide-upscale).
     * *Hires steps:* I recommend at least half as many as your sampling steps. Higher values aren't always better.
     * *Denoising strength:* The most important parameter. Near 0, no detail will be added to the image. Near 1, the image will be changed completely. I recommend something between 0.2 and 0.6 depending on the image.
 &nbsp;
+# Extensions <a name="sdguide-extensions"></a>[▲](#sdguide-index)
 *Stable Diffusion WebUI* supports extensions to add additional functionality and quality of life. These can be added by going into the **Extensions** tab, then **Install from URL**, and pasting the links found here or elsewhere. Then, click *Install* and wait for it to finish. Then, go to **Installed** and click *Apply and restart UI*.
 &nbsp;
+# Loras <a name="sdguide-lora"></a>[▲](#sdguide-index)
+LoRA or *Low-Rank Adaptation* is a form of **Extra Network** and the latest technology that lets you append a smaller model to any of your full models. They are similar to embeddings, one of which you might've seen [earlier](#sdguide-prompt), but Loras are larger and often more capable. Technical details omitted.
 Loras can represent a character, an artstyle, poses, clothes, or even a human face (though I do not endorse this). Models are usually capable enough for general work, but when it comes to specific details with little existing examples, they fall short. That's where Loras come in. They can be downloaded from [civitai](https://civitai.com) and are 144 MB by default, but they can go as low as 1 MB and sometimes several hundreds of MB. Bigger Loras are not necessarily better. They come in `.safetensor` format, same as models.
+Place your lora files in the `stable-diffusion-webui/models/Lora` folder. Then, look for the 🎴 *Show extra networks* button below the big orange Generate button. It will open a new section. Click on the Lora tab and press the **Refresh** button, and your loras should appear. When you click a Lora in that menu it will get added to your prompt. It will look like this: `<lora:filename:1` The start is always the same. The filename will be the exact filename in your system without the `.safetensors` extension. Finally, the number is the weight, like we saw in [Prompts](#sdguide-prompt). Most Loras work between 0.5 and 1 weight, and too high values might "fry" your image.
 An example of a Lora is [Thicker Lines Anime Style](https://civitai.com/models/13910/thicker-lines-anime-style-lora-mix), which is perfect if you want your images to look more like traditional anime.
 &nbsp;
+# Upscalers <a name="sdguide-upscale"></a>[▲](#sdguide-index)
 You can download additional upscalers and put them in your `stable-diffusion-webui/models/ESRGAN` folder.