e621-rising-v3 / README.md
hearmeneigh's picture
Update README.md
dcb0179
|
raw
history blame
4.66 kB
metadata
license: mit
datasets:
  - hearmeneigh/e621-rising-v3-curated
  - hearmeneigh/e621-rising-v3-finetuner
library_name: diffusers
pipeline_tag: text-to-image
tags:
  - anthro
  - furry
  - e621
  - nsfw
  - booru
  - imagebooru
  - imageboard
  - gelbooru
  - danbooru
  - rule34
  - not-for-all-audiences

NSFW

This model is not suitable for use by minors. The model can and will produce X-rated/NFSW content.

E621 Rising V3 (SDXL)

  • Furry / anthro base model trained with images (mainly) from E621
  • Guaranteed NSFW or your money back
  • Stable Diffusion XL 1.0 model:
    • 1024x1024px
    • Trained with 11 epochs of 280,000 images each
    • Finetuned with 16 epochs of 40,000 images each
  • Compatible with 🤗 diffusers
  • Compatible with stable-diffusion-webui
  • Fully open source crawl, dataset, curation, and training process:

Training Procedure

Training legend

  • 160 images per batch (epoch variant)
  • 1024x1024px image size
  • Adam optimizer
    • Beta1 = 0.9
    • Beta2 = 0.999
    • Weight decay = 1e-2
    • Epsilon = 1e-08
  • Constant learning rate 4e-6
  • fp16 mixed precision
  • SNR gamma set to 5.0
  • Noise offset set to 0.07
  • cosine_with_restarts scheduler
  • 11 epochs of V3 curated dataset samples resized to < 1024x1024px (maintain aspect ratio)
  • 16 epochs of V3 finetuner dataset samples resized to < 1024x1024px (maintain aspect ratio)
  • Tags for each sample are shuffled for each epoch