Text-to-Image
Diffusers
Safetensors
English
StableDiffusionXLPipeline
Inference Endpoints
KBlueLeaf commited on
Commit
86d0880
1 Parent(s): a3a596a

Upload main model

Browse files
LICENSE ADDED
@@ -0,0 +1,152 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Fair AI Public License 1.0-SD
2
+
3
+ Published by the [Freedom of Development Project](https://freedevproject.org).
4
+
5
+ *This "SD variant" license contains a [Prohibited Uses](#prohibited-uses)
6
+ section designed to be compatible with Stable Diffusion's license. Because of
7
+ that section, this is not a free software license. Unless you are releasing a
8
+ derivative of a Stable Diffusion model, it is not recommended that you use this
9
+ license.*
10
+
11
+ *This license comes with special requirements if you intend to allow users
12
+ to access this software over a network. See [Notices](#notices) for more
13
+ information.*
14
+
15
+ ## Purpose
16
+ This license gives everyone as much permission to work with this software as
17
+ possible, while protecting contributors from liability, protecting the
18
+ freedom of end users, and reducing harm.
19
+
20
+ ## Definitions
21
+ In this license, "model" refers to machine learning model weights, biases,
22
+ parameters, optimizer states, and any byproducts of a training or pretraining
23
+ process, whether in the form of checkpoints or any other form.
24
+
25
+ The term "derived model" refers to any model based on this model.
26
+
27
+ The term "software" also refers to any model along with documentation or
28
+ other resources provided with the software.
29
+
30
+ The term "source code" refers to the preferred form of making modifications
31
+ to software. It also includes any models, if applicable, but it does not
32
+ include any datasets used to train a model.
33
+
34
+ To "modify" also means to perform any training on a model or to combine a
35
+ model with another model.
36
+
37
+ ## Acceptance
38
+ In order to receive this license, you must agree to its rules. The rules of
39
+ this license are both obligations under that agreement and conditions to your
40
+ license. You must not do anything with this software that triggers a rule that
41
+ you cannot or will not follow. If you do not agree, then you cannot use this
42
+ software in any way.
43
+
44
+ ## Copyright
45
+ Each contributor licenses you to do everything with this software that would
46
+ otherwise infringe that contributor's copyright in it.
47
+
48
+ ## Freedom
49
+ Neither this software nor any work that is combined with this software will be
50
+ considered a technological protection measure under the WIPO Copyright Treaty
51
+ or any similar law. Reverse engineering of this software and of any work that
52
+ is combined with this software is always allowed.
53
+
54
+ ## Notices
55
+ You must ensure that everyone who gets a copy of any part of this software from
56
+ you, with or without changes, also gets the text of this license along with
57
+ the corresponding source code.
58
+
59
+ If you modify this software and allow users to interact with it through a
60
+ computer network, you must ensure they have a reasonable way to receive the
61
+ corresponding source code from you, whether that is via a download link or a
62
+ prominent written offer. As a special case, if you are only allowing users to
63
+ interact with a derived model, then you may choose to provide a download link
64
+ or written offer only for the derived model.
65
+
66
+ This software, all source code, and all modifications must be provided under
67
+ this license or another license that allows everything this license allows.
68
+ Note that this does not give you permission to change the license for this
69
+ software.
70
+
71
+ ## Excuse
72
+ If anyone notifies you in writing that you have not complied with
73
+ [Notices](#notices), you can keep your license by taking all practical steps
74
+ to comply within 30 days after the notice. If you do not do so, your license
75
+ ends immediately.
76
+
77
+ ## Output
78
+ The output of this software is not covered by this license, and no contributor
79
+ claims any rights to it.
80
+
81
+ ## Patent
82
+ Each contributor licenses you to do everything with this software that would
83
+ otherwise infringe any patent claims they can license or become able to license.
84
+
85
+ ## Reliability
86
+ No contributor can revoke this license.
87
+
88
+ ## Alternatives
89
+ You can also use any non-model parts of this software under the terms of the
90
+ GNU AGPL 3.0, or any later version of that license. If you do,
91
+ [No Harm](#no-harm) and [No Liability](#no-liability) still apply.
92
+
93
+ ## Revisions
94
+ The Freedom of Development Project may publish revised or new versions of the
95
+ Fair AI Public License. Those new versions will be similar in spirit to this
96
+ license.
97
+
98
+ Unless a contributor specifies otherwise, you have the option of following the
99
+ terms of any later version of this license. Your choice to follow a later
100
+ version of the license will not impose additional obligations on any
101
+ contributor. Even if you do choose to follow a later version, the restrictions
102
+ of [Prohibited Uses](#prohibited-uses) will still apply.
103
+
104
+ ## Survival
105
+ The provisions of [No Harm](#no-harm) and [No Liability](#no-liability) survive
106
+ the end of your license.
107
+
108
+ ## No Harm
109
+ You agree that no contributor's conduct in the creation of this software has
110
+ caused you any harm. As far as the law allows, you give up your right to pursue
111
+ any kind of legal claim against any contributor for actions related the
112
+ creation of this software, even if those actions broke a previous agreement.
113
+
114
+ Additionally, you agree not to use this model for harmful purposes, as listed
115
+ in [Prohibited Uses](#prohibited-uses). These restrictions do not apply to
116
+ non-model parts of this software.
117
+
118
+ ## No Liability
119
+ ***As far as the law allows, this software comes as is, without any warranty or
120
+ condition, and no contributor will be liable to anyone for any damages related
121
+ to this software or this license, under any kind of legal claim.***
122
+
123
+ ## Prohibited Uses
124
+ You may not use this model or any derived model for the following:
125
+
126
+ - In any way that violates any applicable national, federal, state, local or
127
+ international law or regulation;
128
+ - For the purpose of exploiting, harming or attempting to exploit or harm
129
+ minors in any way;
130
+ - To generate or disseminate verifiably false information and/or content with
131
+ the purpose of harming others;
132
+ - To generate or disseminate personal identifiable information that can be used
133
+ to harm an individual;
134
+ - To defame, disparage or otherwise harass others;
135
+ - For fully automated decision making that adversely impacts an individual’s
136
+ legal rights or otherwise creates or modifies a binding, enforceable obligation;
137
+ - For any use intended to or which has the effect of discriminating against or
138
+ harming individuals or groups based on online or offline social behavior or
139
+ known or predicted personal or personality characteristics;
140
+ - To exploit any of the vulnerabilities of a specific group of persons based on
141
+ their age, social, physical or mental characteristics, in order to materially
142
+ distort the behavior of a person pertaining to that group in a manner that
143
+ causes or is likely to cause that person or another person physical or
144
+ psychological harm;
145
+ - For any use intended to or which has the effect of discriminating against
146
+ individuals or groups based on legally protected characteristics or categories;
147
+ - To provide medical advice and medical results interpretation;
148
+ - To generate or disseminate information for the purpose to be used for
149
+ administration of justice, law enforcement, immigration or asylum processes,
150
+ such as predicting an individual will commit fraud/crime commitment (e.g. by
151
+ text profiling, drawing causal relationships between assertions made in
152
+ documents, indiscriminate and arbitrarily-targeted use).
kohaku-xl-delta-rev1.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:45c05e1d940f3e4cda231cafc953dfd8644b8e024db5c316bf17429acb834bf5
3
+ size 6938040286
model_index.json ADDED
@@ -0,0 +1,41 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_class_name": "StableDiffusionXLPipeline",
3
+ "_diffusers_version": "0.25.1",
4
+ "feature_extractor": [
5
+ null,
6
+ null
7
+ ],
8
+ "force_zeros_for_empty_prompt": true,
9
+ "image_encoder": [
10
+ null,
11
+ null
12
+ ],
13
+ "scheduler": [
14
+ "diffusers",
15
+ "EulerDiscreteScheduler"
16
+ ],
17
+ "text_encoder": [
18
+ "transformers",
19
+ "CLIPTextModel"
20
+ ],
21
+ "text_encoder_2": [
22
+ "transformers",
23
+ "CLIPTextModelWithProjection"
24
+ ],
25
+ "tokenizer": [
26
+ "transformers",
27
+ "CLIPTokenizer"
28
+ ],
29
+ "tokenizer_2": [
30
+ "transformers",
31
+ "CLIPTokenizer"
32
+ ],
33
+ "unet": [
34
+ "diffusers",
35
+ "UNet2DConditionModel"
36
+ ],
37
+ "vae": [
38
+ "diffusers",
39
+ "AutoencoderKL"
40
+ ]
41
+ }
scheduler/scheduler_config.json ADDED
@@ -0,0 +1,22 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_class_name": "EulerDiscreteScheduler",
3
+ "_diffusers_version": "0.25.1",
4
+ "beta_end": 0.012,
5
+ "beta_schedule": "scaled_linear",
6
+ "beta_start": 0.00085,
7
+ "clip_sample": false,
8
+ "interpolation_type": "linear",
9
+ "num_train_timesteps": 1000,
10
+ "prediction_type": "epsilon",
11
+ "rescale_betas_zero_snr": false,
12
+ "sample_max_value": 1.0,
13
+ "set_alpha_to_one": false,
14
+ "sigma_max": null,
15
+ "sigma_min": null,
16
+ "skip_prk_steps": true,
17
+ "steps_offset": 1,
18
+ "timestep_spacing": "leading",
19
+ "timestep_type": "discrete",
20
+ "trained_betas": null,
21
+ "use_karras_sigmas": false
22
+ }
text_encoder/config.json ADDED
@@ -0,0 +1,25 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": null,
3
+ "architectures": [
4
+ "CLIPTextModel"
5
+ ],
6
+ "attention_dropout": 0.0,
7
+ "bos_token_id": 0,
8
+ "dropout": 0.0,
9
+ "eos_token_id": 2,
10
+ "hidden_act": "quick_gelu",
11
+ "hidden_size": 768,
12
+ "initializer_factor": 1.0,
13
+ "initializer_range": 0.02,
14
+ "intermediate_size": 3072,
15
+ "layer_norm_eps": 1e-05,
16
+ "max_position_embeddings": 77,
17
+ "model_type": "clip_text_model",
18
+ "num_attention_heads": 12,
19
+ "num_hidden_layers": 12,
20
+ "pad_token_id": 1,
21
+ "projection_dim": 768,
22
+ "torch_dtype": "float16",
23
+ "transformers_version": "4.38.1",
24
+ "vocab_size": 49408
25
+ }
text_encoder/pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b80d9e5b69ae659d84a6b37b01266f54766555b84904a5dfe6ae542a50341e27
3
+ size 246185562
text_encoder_2/config.json ADDED
@@ -0,0 +1,25 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": null,
3
+ "architectures": [
4
+ "CLIPTextModelWithProjection"
5
+ ],
6
+ "attention_dropout": 0.0,
7
+ "bos_token_id": 0,
8
+ "dropout": 0.0,
9
+ "eos_token_id": 2,
10
+ "hidden_act": "gelu",
11
+ "hidden_size": 1280,
12
+ "initializer_factor": 1.0,
13
+ "initializer_range": 0.02,
14
+ "intermediate_size": 5120,
15
+ "layer_norm_eps": 1e-05,
16
+ "max_position_embeddings": 77,
17
+ "model_type": "clip_text_model",
18
+ "num_attention_heads": 20,
19
+ "num_hidden_layers": 32,
20
+ "pad_token_id": 1,
21
+ "projection_dim": 1280,
22
+ "torch_dtype": "float16",
23
+ "transformers_version": "4.38.1",
24
+ "vocab_size": 49408
25
+ }
text_encoder_2/pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:14c976ae0e2d1c4e2095a9ca3c5afcb1a8d448d60ae667737dd53ce105d7fcb8
3
+ size 1389490462
tokenizer/merges.txt ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer/special_tokens_map.json ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<|startoftext|>",
4
+ "lstrip": false,
5
+ "normalized": true,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "eos_token": {
10
+ "content": "<|endoftext|>",
11
+ "lstrip": false,
12
+ "normalized": true,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": "<|endoftext|>",
17
+ "unk_token": {
18
+ "content": "<|endoftext|>",
19
+ "lstrip": false,
20
+ "normalized": true,
21
+ "rstrip": false,
22
+ "single_word": false
23
+ }
24
+ }
tokenizer/tokenizer_config.json ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "add_prefix_space": false,
3
+ "added_tokens_decoder": {
4
+ "49406": {
5
+ "content": "<|startoftext|>",
6
+ "lstrip": false,
7
+ "normalized": true,
8
+ "rstrip": false,
9
+ "single_word": false,
10
+ "special": true
11
+ },
12
+ "49407": {
13
+ "content": "<|endoftext|>",
14
+ "lstrip": false,
15
+ "normalized": true,
16
+ "rstrip": false,
17
+ "single_word": false,
18
+ "special": true
19
+ }
20
+ },
21
+ "bos_token": "<|startoftext|>",
22
+ "clean_up_tokenization_spaces": true,
23
+ "do_lower_case": true,
24
+ "eos_token": "<|endoftext|>",
25
+ "errors": "replace",
26
+ "model_max_length": 77,
27
+ "pad_token": "<|endoftext|>",
28
+ "tokenizer_class": "CLIPTokenizer",
29
+ "unk_token": "<|endoftext|>"
30
+ }
tokenizer/vocab.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_2/merges.txt ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_2/special_tokens_map.json ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<|startoftext|>",
4
+ "lstrip": false,
5
+ "normalized": true,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "eos_token": {
10
+ "content": "<|endoftext|>",
11
+ "lstrip": false,
12
+ "normalized": true,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": "!",
17
+ "unk_token": {
18
+ "content": "<|endoftext|>",
19
+ "lstrip": false,
20
+ "normalized": true,
21
+ "rstrip": false,
22
+ "single_word": false
23
+ }
24
+ }
tokenizer_2/tokenizer_config.json ADDED
@@ -0,0 +1,38 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "add_prefix_space": false,
3
+ "added_tokens_decoder": {
4
+ "0": {
5
+ "content": "!",
6
+ "lstrip": false,
7
+ "normalized": false,
8
+ "rstrip": false,
9
+ "single_word": false,
10
+ "special": true
11
+ },
12
+ "49406": {
13
+ "content": "<|startoftext|>",
14
+ "lstrip": false,
15
+ "normalized": true,
16
+ "rstrip": false,
17
+ "single_word": false,
18
+ "special": true
19
+ },
20
+ "49407": {
21
+ "content": "<|endoftext|>",
22
+ "lstrip": false,
23
+ "normalized": true,
24
+ "rstrip": false,
25
+ "single_word": false,
26
+ "special": true
27
+ }
28
+ },
29
+ "bos_token": "<|startoftext|>",
30
+ "clean_up_tokenization_spaces": true,
31
+ "do_lower_case": true,
32
+ "eos_token": "<|endoftext|>",
33
+ "errors": "replace",
34
+ "model_max_length": 77,
35
+ "pad_token": "!",
36
+ "tokenizer_class": "CLIPTokenizer",
37
+ "unk_token": "<|endoftext|>"
38
+ }
tokenizer_2/vocab.json ADDED
The diff for this file is too large to render. See raw diff
 
unet/config.json ADDED
@@ -0,0 +1,72 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_class_name": "UNet2DConditionModel",
3
+ "_diffusers_version": "0.25.1",
4
+ "act_fn": "silu",
5
+ "addition_embed_type": "text_time",
6
+ "addition_embed_type_num_heads": 64,
7
+ "addition_time_embed_dim": 256,
8
+ "attention_head_dim": [
9
+ 5,
10
+ 10,
11
+ 20
12
+ ],
13
+ "attention_type": "default",
14
+ "block_out_channels": [
15
+ 320,
16
+ 640,
17
+ 1280
18
+ ],
19
+ "center_input_sample": false,
20
+ "class_embed_type": null,
21
+ "class_embeddings_concat": false,
22
+ "conv_in_kernel": 3,
23
+ "conv_out_kernel": 3,
24
+ "cross_attention_dim": 2048,
25
+ "cross_attention_norm": null,
26
+ "down_block_types": [
27
+ "DownBlock2D",
28
+ "CrossAttnDownBlock2D",
29
+ "CrossAttnDownBlock2D"
30
+ ],
31
+ "downsample_padding": 1,
32
+ "dropout": 0.0,
33
+ "dual_cross_attention": false,
34
+ "encoder_hid_dim": null,
35
+ "encoder_hid_dim_type": null,
36
+ "flip_sin_to_cos": true,
37
+ "freq_shift": 0,
38
+ "in_channels": 4,
39
+ "layers_per_block": 2,
40
+ "mid_block_only_cross_attention": null,
41
+ "mid_block_scale_factor": 1,
42
+ "mid_block_type": "UNetMidBlock2DCrossAttn",
43
+ "norm_eps": 1e-05,
44
+ "norm_num_groups": 32,
45
+ "num_attention_heads": null,
46
+ "num_class_embeds": null,
47
+ "only_cross_attention": false,
48
+ "out_channels": 4,
49
+ "projection_class_embeddings_input_dim": 2816,
50
+ "resnet_out_scale_factor": 1.0,
51
+ "resnet_skip_time_act": false,
52
+ "resnet_time_scale_shift": "default",
53
+ "reverse_transformer_layers_per_block": null,
54
+ "sample_size": 128,
55
+ "time_cond_proj_dim": null,
56
+ "time_embedding_act_fn": null,
57
+ "time_embedding_dim": null,
58
+ "time_embedding_type": "positional",
59
+ "timestep_post_act": null,
60
+ "transformer_layers_per_block": [
61
+ 1,
62
+ 2,
63
+ 10
64
+ ],
65
+ "up_block_types": [
66
+ "CrossAttnUpBlock2D",
67
+ "CrossAttnUpBlock2D",
68
+ "UpBlock2D"
69
+ ],
70
+ "upcast_attention": false,
71
+ "use_linear_projection": true
72
+ }
unet/diffusion_pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a3c6a9b00ad2785fe5da1168dc4b507c9df2c420325173742e9ff6591e6f6bdc
3
+ size 5135669022
vae/config.json ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_class_name": "AutoencoderKL",
3
+ "_diffusers_version": "0.25.1",
4
+ "act_fn": "silu",
5
+ "block_out_channels": [
6
+ 128,
7
+ 256,
8
+ 512,
9
+ 512
10
+ ],
11
+ "down_block_types": [
12
+ "DownEncoderBlock2D",
13
+ "DownEncoderBlock2D",
14
+ "DownEncoderBlock2D",
15
+ "DownEncoderBlock2D"
16
+ ],
17
+ "force_upcast": true,
18
+ "in_channels": 3,
19
+ "latent_channels": 4,
20
+ "layers_per_block": 2,
21
+ "norm_num_groups": 32,
22
+ "out_channels": 3,
23
+ "sample_size": 256,
24
+ "scaling_factor": 0.18215,
25
+ "up_block_types": [
26
+ "UpDecoderBlock2D",
27
+ "UpDecoderBlock2D",
28
+ "UpDecoderBlock2D",
29
+ "UpDecoderBlock2D"
30
+ ]
31
+ }
vae/diffusion_pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b921a15c9833d884465b5faca30a1a2fdd57cf7fc60c33a47d87be4dc3806afc
3
+ size 167404866