jimmycarter's picture
Update README.md (#2)
220a92c verified
metadata
license: creativeml-openrail-m
base_model: black-forest-labs/FLUX.1-dev
tags:
  - stable-diffusion
  - stable-diffusion-diffusers
  - text-to-image
  - diffusers
  - simpletuner
  - lora
  - template:sd-lora
inference: true
widget:
  - text: unconditional (blank prompt)
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_0_0.png
  - text: loona from helluva boss is eating a donut
    parameters:
      negative_prompt: blurry, cropped, ugly
    output:
      url: ./assets/image_1_0.png

flux-training-losercity-next

This is a LoRA derived from black-forest-labs/FLUX.1-dev.

Various Loonas

Example prompts:

prompts = [
    'In this scene from the animated series "Helluva Boss," Loona, the wolf-like receptionist of the Immediate Murder Professionals (I.M.P), is depicted leaning against a wall outside the office. She is casually engrossed in her phone, displaying her typical aloof and detached demeanor. Loona\'s appearance includes her usual whitish fur, light grey hair, black-tipped ears, and red eyes, complemented by her punk-inspired attire featuring a black choker with spikes, a dark grey top, fingerless wrist-length black gloves, and black shorts.',
    'Loona shrugs with an exasperated expression, her red eyes wide and frustrated, as she seemingly questions or challenges something said in the I.M.P office. Still from Helluva boss. Loona\'s appearance includes her usual whitish fur, light grey hair, black-tipped ears, and red eyes, complemented by her punk-inspired attire featuring a black choker with spikes, a dark grey top, fingerless wrist-length black gloves, and black shorts.',
    "A scene from the animated series \"Helluva Boss,\" set in the office. Loona, the wolf-like receptionist with white fur, black-tipped ears, and red eyes, is seated on a couch, facing towards the viewer. Loona\'s appearance is complemented by her punk-inspired attire featuring a black choker with spikes, a dark grey top, fingerless wrist-length black gloves, and black shorts. She holds a piece of paper that says,\"Welcome to Losercity, jerks\". In the background, the office has a striped wall pattern and visible damage on the ceiling, indicating a chaotic or rough environment. On the right side of the image, two imp characters appear to be engaged in conversation.",
    "Loona from Helluva Boss is dressed in an oversized taco costume, looking visibly irritated and embarrassed. Her red eyes convey her annoyance as she crosses her arms and glares to the side. Loona\'s appearance includes her usual whitish fur, light grey hair, black-tipped ears, and red eyes",
]

To use Loona in classic style, just add the following trigger sentence to your prompt:

Loona's appearance includes her usual whitish fur, light grey hair, black-tipped ears, and red eyes, complemented by her punk-inspired attire featuring a black choker with spikes, a dark grey top, fingerless wrist-length black gloves, and black shorts.

The main validation prompt used during training was:

loona from helluva boss is eating a donut

Validation settings

  • CFG: 3.5
  • CFG Rescale: 0.0
  • Steps: 15
  • Sampler: None
  • Seed: 42
  • Resolution: 1024

Note: The validation settings are not necessarily the same as the training settings.

You can find some example images in the following gallery:

Prompt
unconditional (blank prompt)
Negative Prompt
blurry, cropped, ugly
Prompt
loona from helluva boss is eating a donut
Negative Prompt
blurry, cropped, ugly

The text encoder was not trained. You may reuse the base model text encoder for inference.

Training settings

  • Training epochs: 428
  • Training steps: 3000
  • Learning rate: 0.0001
  • Effective batch size: 6
    • Micro-batch size: 6
    • Gradient accumulation steps: 1
    • Number of GPUs: 1
  • Prediction type: flow-matching
  • Rescaled betas zero SNR: False
  • Optimizer: AdamW, stochastic bf16
  • Precision: Pure BF16
  • Xformers: Enabled
  • LoRA Rank: 64
  • LoRA Alpha: None
  • LoRA Dropout: 0.1
  • LoRA initialisation style: default

Datasets

losercity

  • Repeats: 0
  • Total number of images: 42
  • Total number of aspect buckets: 1
  • Resolution: 1.0 megapixels
  • Cropped: True
  • Crop style: center
  • Crop aspect: square

Inference

import torch
from diffusers import DiffusionPipeline

model_id = 'black-forest-labs/FLUX.1-dev'
adapter_id = 'jimmycarter/flux-training-losercity-next'
pipeline = DiffusionPipeline.from_pretrained(model_id)
pipeline.load_lora_weights(adapter_id)

prompt = "loona from helluva boss is eating a donut"


pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu')
image = pipeline(
    prompt=prompt,
    num_inference_steps=15,
    generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
    width=1024,
    height=1024,
    guidance_scale=3.5,
).images[0]
image.save("output.png", format="PNG")