Ai-tensa
/

FlexWaifu

stable-diffusion

Model card Files Files and versions Community

FlexWaifu / README.md

Ai-tensa's picture

upload images

ddbd7b5 over 1 year ago

|

3.17 kB

	---
	language:
	- en
	tags:
	- stable-diffusion
	- text-to-image
	license: creativeml-openrail-m
	inference: false

	---

	# Flex Waifu

	FlexWaifu is a fine-tuned model from Waifu Diffusion 1.3 for producing high resolution or different aspect ratio images.

	## Model Description

	This model was created by merging two original LoRAs of [testLoRAs](https://huggingface.co/Ai-tensa/testLoRAs) into WD1.3.

	\| Model Name \| Recipe \|
	\| ---------- \| ------------------------------------------ \|
	\| FlexWaifu \| WD1.3 + 2.0 * hires_test_a + smooth_test_a \|

	It is just a merged model.
	While this model is likely to produce good generation at medium resolution, consider using LoRAs of [testLoRAs](https://huggingface.co/Ai-tensa/testLoRAs) if it does not produce well.

	## Flex Waifu Rainbow

	This model is further fine-tuned from FlexWaifu with ~17k nijijourneyv5 tagged images of various authors published on the Internet.
	It is merged from six dim 8 LoRAs made in various settings, and FWRLoRA is the merged LoRA (dim 48).
	Most LoRAs were fine-tuned with Aspect Ratio Backetting with a maximum resolution of 1152x768 images, but some are up to 768x768 or 512x768.
	Image captions are made by BLIP and ~12k images also used WD1.4-tagger.

	\| Model Name \| Recipe \|
	\| ---------------- \| ------------------- \|
	\| FlexWaifuRainbow \| FlexWaifu + FWRLoRA \|

	### Usage

	The format of the caption suggests that a short natural language sentence followed by a comma-separated tags is the most natural way to describe the prompt.
	Using more tags that are well-estimated by the tagger in the trained images may lead to more preferable generation.
	"tags.json" lists the tags estimated for over 200 of the 12k images.
	Tag semantics may be inappropriate for automatic tagging, so please emphasize appropriately.

	## License

	This model is open access and available to all, with a CreativeML OpenRAIL-M license further specifying rights and usage.
	The CreativeML OpenRAIL License specifies:

	1. You can't use the model to deliberately produce nor share illegal or harmful outputs or content
	2. The authors claims no rights on the outputs you generate, you are free to use them and are accountable for their use which must not go against the provisions set in the license
	3. You may re-distribute the weights and use the model commercially and/or as a service. If you do, please be aware you have to include the same use restrictions as the ones in the license and share a copy of the CreativeML OpenRAIL-M to all your users (please read the license entirely and carefully)
	[Please read the full license here](https://huggingface.co/spaces/CompVis/stable-diffusion-license)

	## Acknowledgements

	These Models build on the two excellent works: SD1.4, developed by [CompVis Researchers](https://ommer-lab.com/), and WD1.3, developed by [Anthony Mercurio](https://github.com/harubaru), [Salt](https://github.com/sALTaccount/), and [Cafe](https://twitter.com/cafeai_labs).

	## Examples

	![](images/01013-3282638012.png)
	![](images/grid-0170.png)
	![](images/grid-0174.png)
	![](images/grid-0176.png)
	![](images/grid-0182.png)
	![](images/grid-0183.png)