e621-rising-v3 / README.md
hearmeneigh's picture
Update README.md
65c31ae
metadata
license: mit
datasets:
  - hearmeneigh/e621-rising-v3-curated
  - hearmeneigh/e621-rising-v3-finetuner
library_name: diffusers
pipeline_tag: text-to-image
tags:
  - anthro
  - furry
  - e621
  - nsfw
  - booru
  - imagebooru
  - imageboard
  - gelbooru
  - danbooru
  - rule34
  - not-for-all-audiences

NSFW

This model is not suitable for use by minors. The model can and will produce X-rated/NFSW content.

E621 Rising V3 (SDXL)

  • Furry / anthro base model trained with images (mainly) from E621
  • Guaranteed NSFW or your money back
  • Stable Diffusion XL 1.0 base model:
    • 1024x1024px
    • Trained with 11 epochs of 280,000 images each
    • Finetuned with 23 epochs of 40,000 images each
  • Compatible with:
  • Fully open source crawl, dataset, curation, and training process:

Examples

For more examples, continue here.

Birb Royal Stallion Assassin Neeked Neko

For more examples, continue here.

Training Procedure

Training legend

  • 160 images per batch (epoch variant)
  • 1024x1024px image size
  • Adam optimizer
    • Beta1 = 0.9
    • Beta2 = 0.999
    • Weight decay = 1e-2
    • Epsilon = 1e-08
  • Constant learning rate 4e-6
  • fp16 mixed precision
  • SNR gamma set to 5.0
  • Noise offset set to 0.07
  • cosine_with_restarts scheduler
  • 11 epochs of V3 curated dataset samples resized to < 1024x1024px (maintain aspect ratio)
  • 16 epochs of V3 finetuner dataset samples resized to < 1024x1024px (maintain aspect ratio)
  • 6 epochs of V3 finetuner dataset samples resized to < 1024x1024px (maintain aspect ratio, randomly drop 70% of tags)
  • 1 epoch of V3 finetuner dataset samples resized to < 1024x1024px (maintain aspect ratio, randomly drop 50% of tags) and learning rate set to 4e-5
  • Tags for each sample are shuffled for each epoch