metadata

tags:
  - text-to-image
  - stable-diffusion
  - lora
  - diffusers
  - template:sd-lora
  - flux
  - flux dev
  - realism
widget:
  - text: >-
      phone photo five men playing a Medieval diplomacy game around a table on a
      couch in a living room at night in 2014
    output:
      url: images/ComfyUI_00855_.png
  - text: >-
      phone photo of two women in roman cosplay outfits holding a sign reading
      'Boreal-FD' on top of a dining room table in front of a crowd in New York
      at night
    output:
      url: images/ComfyUI_00822_.png
  - text: >-
      phone photos of three people performing a ritualistic sacrifice in a busy
      hotel lobby with a demon
    output:
      url: images/ComfyUI_00944_.png
  - text: >-
      closeup phone photo of a 25 year old women wearing a yoshi cosplay outfit
      while riding a zebra near a crowd while showing a piece of paper
      with'Boreal-FD' written on it at noon in the summer in a alley in new york
      city
    output:
      url: images/ComfyUI_00845_.png
  - text: >-
      phone photo of two men eating a full sad potato at a at a restaurant in
      2017 posted to reddit
    output:
      url: images/ComfyUI_01026_.png
base_model:
  - black-forest-labs/FLUX.1-dev
  - stabilityai/stable-diffusion-3.5-large
  - nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
instance_prompt: photo

Boreal-FD

Prompt
phone photo five men playing a Medieval diplomacy game around a table on a couch in a living room at night in 2014

Prompt
phone photos of three people performing a ritualistic sacrifice in a busy hotel lobby with a demon

Prompt
closeup phone photo of a 25 year old women wearing a yoshi cosplay outfit while riding a zebra near a crowd while showing a piece of paper with'Boreal-FD' written on it at noon in the summer in a alley in new york city

Prompt
phone photo of two men eating a full sad potato at a at a restaurant in 2017 posted to reddit

Model description

Work in Progress This is a very early experimental lora for Flux-Dev. It uses the Boring Reality image datasets to work on shifting Flux towards more realistic images.

As with most other AI image generative models, the flux-dev model is biased towards certain photographic aesthetics like shallow depth of fields with centralized posing along with all the artwork influence as well. As a result the models produce very limited types of photos which tends to mask how much knowledge the model actually has.

The goal with these boring reality trained loras is to not only bring out better photorealistic images but to push the model to show how much knowledge and information it can actually place in a single generated image.

Update 08/21 I am still exploring new ways to train this model/dataset. For the timebeing, the faded dot issue remains in these LoRAs as these older models have performed a better job of learning the concept than any of the subsequent runs I have done. The new Schnell version may be released before updating this Dev model. To simplify use of the model, I removed the 400 steps weights to help with resolve issues. If necesary I will add the 400 steps as a seperate model, though you can probably get a slightly similar result reducing the strength of the 1000 steps version. I will try to seperate models going forward, though it is not the best strategy when you need to manually choose between undertrained/overtrained LoRAs on top of their strength for each image.

Primary Goals for Boreal-FD

Reduce how often shallow depths of field appear in images
More dynamic poses
More realistic skin texture
More interesting backgrounds
Overall Increase scene complexity

Additional Notes These two flux loras are not expected to create very good images. Many results may be overfitted, distorted, and have this slight faded dotted look for lesser known concepts.

The 1000 step lora is more over-fitted with with distortion and lack of prompt understanding more likely to occur, but it may perform better on things like dynamic posing and skin texture.

You will want to experiment between the two loras, tweaking the lora strengths between 0.5-2.0 and guidance between 3.0-5.0 along with testing many different seeds.

As more understanding develops for Flux, better workflows for these current models will come along as well as newer Boreal-FD versions as the training improves.

Trigger words

You should use photo to trigger the image generation.

Download model

Weights for this model are available in Safetensors format.

Download them in the Files & versions tab.