KBlueLeaf commited on Mar 3

Commit

86d0880

•

1 Parent(s): a3a596a

Upload main model

Browse files

Files changed (20) hide show

LICENSE +152 -0
kohaku-xl-delta-rev1.safetensors +3 -0
model_index.json +41 -0
scheduler/scheduler_config.json +22 -0
text_encoder/config.json +25 -0
text_encoder/pytorch_model.bin +3 -0
text_encoder_2/config.json +25 -0
text_encoder_2/pytorch_model.bin +3 -0
tokenizer/merges.txt +0 -0
tokenizer/special_tokens_map.json +24 -0
tokenizer/tokenizer_config.json +30 -0
tokenizer/vocab.json +0 -0
tokenizer_2/merges.txt +0 -0
tokenizer_2/special_tokens_map.json +24 -0
tokenizer_2/tokenizer_config.json +38 -0
tokenizer_2/vocab.json +0 -0
unet/config.json +72 -0
unet/diffusion_pytorch_model.bin +3 -0
vae/config.json +31 -0
vae/diffusion_pytorch_model.bin +3 -0

LICENSE ADDED Viewed

	@@ -0,0 +1,152 @@

+# Fair AI Public License 1.0-SD
+Published by the [Freedom of Development Project](https://freedevproject.org).
+*This "SD variant" license contains a [Prohibited Uses](#prohibited-uses)
+section designed to be compatible with Stable Diffusion's license. Because of
+that section, this is not a free software license. Unless you are releasing a
+derivative of a Stable Diffusion model, it is not recommended that you use this
+license.*
+*This license comes with special requirements if you intend to allow users
+to access this software over a network. See [Notices](#notices) for more
+information.*
+## Purpose
+This license gives everyone as much permission to work with this software as
+possible, while protecting contributors from liability, protecting the
+freedom of end users, and reducing harm.
+## Definitions
+In this license, "model" refers to machine learning model weights, biases,
+parameters, optimizer states, and any byproducts of a training or pretraining
+process, whether in the form of checkpoints or any other form.
+The term "derived model" refers to any model based on this model.
+The term "software" also refers to any model along with documentation or
+other resources provided with the software.
+The term "source code" refers to the preferred form of making modifications
+to software. It also includes any models, if applicable, but it does not
+include any datasets used to train a model.
+To "modify" also means to perform any training on a model or to combine a
+model with another model.
+## Acceptance
+In order to receive this license, you must agree to its rules. The rules of
+this license are both obligations under that agreement and conditions to your
+license. You must not do anything with this software that triggers a rule that
+you cannot or will not follow. If you do not agree, then you cannot use this
+software in any way.
+## Copyright
+Each contributor licenses you to do everything with this software that would
+otherwise infringe that contributor's copyright in it.
+## Freedom
+Neither this software nor any work that is combined with this software will be
+considered a technological protection measure under the WIPO Copyright Treaty
+or any similar law. Reverse engineering of this software and of any work that
+is combined with this software is always allowed.
+## Notices
+You must ensure that everyone who gets a copy of any part of this software from
+you, with or without changes, also gets the text of this license along with
+the corresponding source code.
+If you modify this software and allow users to interact with it through a
+computer network, you must ensure they have a reasonable way to receive the
+corresponding source code from you, whether that is via a download link or a
+prominent written offer. As a special case, if you are only allowing users to
+interact with a derived model, then you may choose to provide a download link
+or written offer only for the derived model.
+This software, all source code, and all modifications must be provided under
+this license or another license that allows everything this license allows.
+Note that this does not give you permission to change the license for this
+software.
+## Excuse
+If anyone notifies you in writing that you have not complied with
+[Notices](#notices), you can keep your license by taking all practical steps
+to comply within 30 days after the notice. If you do not do so, your license
+ends immediately.
+## Output
+The output of this software is not covered by this license, and no contributor
+claims any rights to it.
+## Patent
+Each contributor licenses you to do everything with this software that would
+otherwise infringe any patent claims they can license or become able to license.
+## Reliability
+No contributor can revoke this license.
+## Alternatives
+You can also use any non-model parts of this software under the terms of the
+GNU AGPL 3.0, or any later version of that license. If you do,
+[No Harm](#no-harm) and [No Liability](#no-liability) still apply.
+## Revisions
+The Freedom of Development Project may publish revised or new versions of the
+Fair AI Public License. Those new versions will be similar in spirit to this
+license.
+Unless a contributor specifies otherwise, you have the option of following the
+terms of any later version of this license. Your choice to follow a later
+version of the license will not impose additional obligations on any
+contributor. Even if you do choose to follow a later version, the restrictions
+of [Prohibited Uses](#prohibited-uses) will still apply.
+## Survival
+The provisions of [No Harm](#no-harm) and [No Liability](#no-liability) survive
+the end of your license.
+## No Harm
+You agree that no contributor's conduct in the creation of this software has
+caused you any harm. As far as the law allows, you give up your right to pursue
+any kind of legal claim against any contributor for actions related the
+creation of this software, even if those actions broke a previous agreement.
+Additionally, you agree not to use this model for harmful purposes, as listed
+in [Prohibited Uses](#prohibited-uses). These restrictions do not apply to
+non-model parts of this software.
+## No Liability
+***As far as the law allows, this software comes as is, without any warranty or
+condition, and no contributor will be liable to anyone for any damages related
+to this software or this license, under any kind of legal claim.***
+## Prohibited Uses
+You may not use this model or any derived model for the following:
+- In any way that violates any applicable national, federal, state, local or
+international law or regulation;
+- For the purpose of exploiting, harming or attempting to exploit or harm
+minors in any way;
+- To generate or disseminate verifiably false information and/or content with
+the purpose of harming others;
+- To generate or disseminate personal identifiable information that can be used
+to harm an individual;
+- To defame, disparage or otherwise harass others;
+- For fully automated decision making that adversely impacts an individual’s
+legal rights or otherwise creates or modifies a binding, enforceable obligation;
+- For any use intended to or which has the effect of discriminating against or
+harming individuals or groups based on online or offline social behavior or
+known or predicted personal or personality characteristics;
+- To exploit any of the vulnerabilities of a specific group of persons based on
+their age, social, physical or mental characteristics, in order to materially
+distort the behavior of a person pertaining to that group in a manner that
+causes or is likely to cause that person or another person physical or
+psychological harm;
+- For any use intended to or which has the effect of discriminating against
+individuals or groups based on legally protected characteristics or categories;
+- To provide medical advice and medical results interpretation;
+- To generate or disseminate information for the purpose to be used for
+administration of justice, law enforcement, immigration or asylum processes,
+such as predicting an individual will commit fraud/crime commitment (e.g. by
+text profiling, drawing causal relationships between assertions made in
+documents, indiscriminate and arbitrarily-targeted use).

kohaku-xl-delta-rev1.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:45c05e1d940f3e4cda231cafc953dfd8644b8e024db5c316bf17429acb834bf5
+size 6938040286

model_index.json ADDED Viewed

	@@ -0,0 +1,41 @@

+{
+  "_class_name": "StableDiffusionXLPipeline",
+  "_diffusers_version": "0.25.1",
+  "feature_extractor": [
+    null,
+    null
+  ],
+  "force_zeros_for_empty_prompt": true,
+  "image_encoder": [
+    null,
+    null
+  ],
+  "scheduler": [
+    "diffusers",
+    "EulerDiscreteScheduler"
+  ],
+  "text_encoder": [
+    "transformers",
+    "CLIPTextModel"
+  ],
+  "text_encoder_2": [
+    "transformers",
+    "CLIPTextModelWithProjection"
+  ],
+  "tokenizer": [
+    "transformers",
+    "CLIPTokenizer"
+  ],
+  "tokenizer_2": [
+    "transformers",
+    "CLIPTokenizer"
+  ],
+  "unet": [
+    "diffusers",
+    "UNet2DConditionModel"
+  ],
+  "vae": [
+    "diffusers",
+    "AutoencoderKL"
+  ]
+}

scheduler/scheduler_config.json ADDED Viewed

	@@ -0,0 +1,22 @@

+{
+  "_class_name": "EulerDiscreteScheduler",
+  "_diffusers_version": "0.25.1",
+  "beta_end": 0.012,
+  "beta_schedule": "scaled_linear",
+  "beta_start": 0.00085,
+  "clip_sample": false,
+  "interpolation_type": "linear",
+  "num_train_timesteps": 1000,
+  "prediction_type": "epsilon",
+  "rescale_betas_zero_snr": false,
+  "sample_max_value": 1.0,
+  "set_alpha_to_one": false,
+  "sigma_max": null,
+  "sigma_min": null,
+  "skip_prk_steps": true,
+  "steps_offset": 1,
+  "timestep_spacing": "leading",
+  "timestep_type": "discrete",
+  "trained_betas": null,
+  "use_karras_sigmas": false
+}

text_encoder/config.json ADDED Viewed

	@@ -0,0 +1,25 @@

+{
+  "_name_or_path": null,
+  "architectures": [
+    "CLIPTextModel"
+  ],
+  "attention_dropout": 0.0,
+  "bos_token_id": 0,
+  "dropout": 0.0,
+  "eos_token_id": 2,
+  "hidden_act": "quick_gelu",
+  "hidden_size": 768,
+  "initializer_factor": 1.0,
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 77,
+  "model_type": "clip_text_model",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 1,
+  "projection_dim": 768,
+  "torch_dtype": "float16",
+  "transformers_version": "4.38.1",
+  "vocab_size": 49408
+}

text_encoder/pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b80d9e5b69ae659d84a6b37b01266f54766555b84904a5dfe6ae542a50341e27
+size 246185562

text_encoder_2/config.json ADDED Viewed

	@@ -0,0 +1,25 @@

+{
+  "_name_or_path": null,
+  "architectures": [
+    "CLIPTextModelWithProjection"
+  ],
+  "attention_dropout": 0.0,
+  "bos_token_id": 0,
+  "dropout": 0.0,
+  "eos_token_id": 2,
+  "hidden_act": "gelu",
+  "hidden_size": 1280,
+  "initializer_factor": 1.0,
+  "initializer_range": 0.02,
+  "intermediate_size": 5120,
+  "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 77,
+  "model_type": "clip_text_model",
+  "num_attention_heads": 20,
+  "num_hidden_layers": 32,
+  "pad_token_id": 1,
+  "projection_dim": 1280,
+  "torch_dtype": "float16",
+  "transformers_version": "4.38.1",
+  "vocab_size": 49408
+}

text_encoder_2/pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:14c976ae0e2d1c4e2095a9ca3c5afcb1a8d448d60ae667737dd53ce105d7fcb8
+size 1389490462

tokenizer/merges.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer/special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,24 @@

+{
+  "bos_token": {
+    "content": "<|startoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": "<|endoftext|>",
+  "unk_token": {
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer/tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,30 @@

+{
+  "add_prefix_space": false,
+  "added_tokens_decoder": {
+    "49406": {
+      "content": "<|startoftext|>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "49407": {
+      "content": "<|endoftext|>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "bos_token": "<|startoftext|>",
+  "clean_up_tokenization_spaces": true,
+  "do_lower_case": true,
+  "eos_token": "<|endoftext|>",
+  "errors": "replace",
+  "model_max_length": 77,
+  "pad_token": "<|endoftext|>",
+  "tokenizer_class": "CLIPTokenizer",
+  "unk_token": "<|endoftext|>"
+}

tokenizer/vocab.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_2/merges.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_2/special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,24 @@

+{
+  "bos_token": {
+    "content": "<|startoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": "!",
+  "unk_token": {
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer_2/tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,38 @@

+{
+  "add_prefix_space": false,
+  "added_tokens_decoder": {
+    "0": {
+      "content": "!",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "49406": {
+      "content": "<|startoftext|>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "49407": {
+      "content": "<|endoftext|>",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "bos_token": "<|startoftext|>",
+  "clean_up_tokenization_spaces": true,
+  "do_lower_case": true,
+  "eos_token": "<|endoftext|>",
+  "errors": "replace",
+  "model_max_length": 77,
+  "pad_token": "!",
+  "tokenizer_class": "CLIPTokenizer",
+  "unk_token": "<|endoftext|>"
+}

tokenizer_2/vocab.json ADDED Viewed

The diff for this file is too large to render. See raw diff

unet/config.json ADDED Viewed

	@@ -0,0 +1,72 @@

+{
+  "_class_name": "UNet2DConditionModel",
+  "_diffusers_version": "0.25.1",
+  "act_fn": "silu",
+  "addition_embed_type": "text_time",
+  "addition_embed_type_num_heads": 64,
+  "addition_time_embed_dim": 256,
+  "attention_head_dim": [
+    5,
+    10,
+    20
+  ],
+  "attention_type": "default",
+  "block_out_channels": [
+    320,
+    640,
+    1280
+  ],
+  "center_input_sample": false,
+  "class_embed_type": null,
+  "class_embeddings_concat": false,
+  "conv_in_kernel": 3,
+  "conv_out_kernel": 3,
+  "cross_attention_dim": 2048,
+  "cross_attention_norm": null,
+  "down_block_types": [
+    "DownBlock2D",
+    "CrossAttnDownBlock2D",
+    "CrossAttnDownBlock2D"
+  ],
+  "downsample_padding": 1,
+  "dropout": 0.0,
+  "dual_cross_attention": false,
+  "encoder_hid_dim": null,
+  "encoder_hid_dim_type": null,
+  "flip_sin_to_cos": true,
+  "freq_shift": 0,
+  "in_channels": 4,
+  "layers_per_block": 2,
+  "mid_block_only_cross_attention": null,
+  "mid_block_scale_factor": 1,
+  "mid_block_type": "UNetMidBlock2DCrossAttn",
+  "norm_eps": 1e-05,
+  "norm_num_groups": 32,
+  "num_attention_heads": null,
+  "num_class_embeds": null,
+  "only_cross_attention": false,
+  "out_channels": 4,
+  "projection_class_embeddings_input_dim": 2816,
+  "resnet_out_scale_factor": 1.0,
+  "resnet_skip_time_act": false,
+  "resnet_time_scale_shift": "default",
+  "reverse_transformer_layers_per_block": null,
+  "sample_size": 128,
+  "time_cond_proj_dim": null,
+  "time_embedding_act_fn": null,
+  "time_embedding_dim": null,
+  "time_embedding_type": "positional",
+  "timestep_post_act": null,
+  "transformer_layers_per_block": [
+    1,
+    2,
+    10
+  ],
+  "up_block_types": [
+    "CrossAttnUpBlock2D",
+    "CrossAttnUpBlock2D",
+    "UpBlock2D"
+  ],
+  "upcast_attention": false,
+  "use_linear_projection": true
+}

unet/diffusion_pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a3c6a9b00ad2785fe5da1168dc4b507c9df2c420325173742e9ff6591e6f6bdc
+size 5135669022

vae/config.json ADDED Viewed

	@@ -0,0 +1,31 @@

+{
+  "_class_name": "AutoencoderKL",
+  "_diffusers_version": "0.25.1",
+  "act_fn": "silu",
+  "block_out_channels": [
+    128,
+    256,
+    512,
+    512
+  ],
+  "down_block_types": [
+    "DownEncoderBlock2D",
+    "DownEncoderBlock2D",
+    "DownEncoderBlock2D",
+    "DownEncoderBlock2D"
+  ],
+  "force_upcast": true,
+  "in_channels": 3,
+  "latent_channels": 4,
+  "layers_per_block": 2,
+  "norm_num_groups": 32,
+  "out_channels": 3,
+  "sample_size": 256,
+  "scaling_factor": 0.18215,
+  "up_block_types": [
+    "UpDecoderBlock2D",
+    "UpDecoderBlock2D",
+    "UpDecoderBlock2D",
+    "UpDecoderBlock2D"
+  ]
+}

vae/diffusion_pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b921a15c9833d884465b5faca30a1a2fdd57cf7fc60c33a47d87be4dc3806afc
+size 167404866