hahminlew's picture
Update README.md
11bb5ae
|
raw
history blame
2.97 kB
metadata
license: creativeml-openrail-m
base_model: stabilityai/stable-diffusion-xl-base-1.0
dataset: hahminlew/kream-product-blip-captions
tags:
  - stable-diffusion-xl
  - stable-diffusion-xl-diffusers
  - text-to-image
  - diffusers
  - lora
inference: true

KREAM-Product-Generator

Latest version of the model has been released! Please try it: hahminlew/sdxl-kream-model-lora-2.0

KREAM-Product-Generator is a finetuned text-to-image generative model with a custom dataset collected from KREAM, one of the best online-resell market in Korea. Have fun creating realistic, high-quality fashion items!

You can see detailed instructions to finetune and inference the model in my github repository: fashion-product-generator.

Results

img

Prompts

  • outer, The Nike x Balenciaga down jacket black, a photography of a black down jacket with a logo on the chest.
  • top, (W) Balenciaga x Nike Slip Hoodie Dress Cream, a photography of a cream dress and a hood on.
  • bottom, Supreme Animal Print Baggy Jean Washed Indigo - 23FW, a photography of a dark blue jean with an animal printing on.
  • outer, The North Face x Supreme White Label Nuptse Down Jacket Cream Beige, a photography of a white puffer jacket with a red box logo on the front.
  • top, The Supreme x Stussy Oversized Cotton Black Hoodie, a photography of a black shirt with a hood on and a logo on the chest.
  • bottom, The IAB Studio x Stussy Dye Sweat Wooven Shorts, a photography of a short pants with a logo.

*A more precise model is now available. Please try to generate products through prompt engineering!

Inference

from diffusers import DiffusionPipeline
import torch

pipe = DiffusionPipeline.from_pretrained("stabilityai/stable-diffusion-xl-base-1.0", torch_dtype=torch.float16)
pipe.to("cuda")
pipe.load_lora_weights("hahminlew/sdxl-kream-model-lora")

prompt = "outer, The Nike x Balenciaga down jacket black, a photography of a black down jacket with a logo on the chest."

image = pipe(prompt, num_inference_steps=30, guidance_scale=7.5).images[0]
image.save("example.png")

LoRA text2image fine-tuning Info.

These are LoRA adaption weights for stabilityai/stable-diffusion-xl-base-1.0. The weights were fine-tuned on the hahminlew/kream-product-blip-captions dataset.

LoRA for the text encoder was enabled: False.

Special VAE used for training: madebyollin/sdxl-vae-fp16-fix.

Citation

If you use KREAM Product Dataset and the model in your research or projects, please cite it as:

@misc{lew2023kream,
      author = {Lew, Hah Min},
      title = {KREAM Product BLIP Captions},
      year={2023},
      howpublished= {\url{https://huggingface.co/datasets/hahminlew/kream-product-blip-captions/}}
}