Yasunori Ozaki commited on
Commit
40087b8
1 Parent(s): c0667c6
.gitattributes CHANGED
@@ -32,3 +32,42 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
32
  *.zip filter=lfs diff=lfs merge=lfs -text
33
  *.zst filter=lfs diff=lfs merge=lfs -text
34
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
32
  *.zip filter=lfs diff=lfs merge=lfs -text
33
  *.zst filter=lfs diff=lfs merge=lfs -text
34
  *tfevents* filter=lfs diff=lfs merge=lfs -text
35
+ control_picasso11_openpose.ckpt filter=lfs diff=lfs merge=lfs -text
36
+ pose_1.png filter=lfs diff=lfs merge=lfs -text
37
+ girl_1.png filter=lfs diff=lfs merge=lfs -text
38
+ sample_poses/bone_17.png filter=lfs diff=lfs merge=lfs -text
39
+ sample_poses/bone_30.png filter=lfs diff=lfs merge=lfs -text
40
+ sample_poses/bone_0.png filter=lfs diff=lfs merge=lfs -text
41
+ sample_poses/bone_21.png filter=lfs diff=lfs merge=lfs -text
42
+ sample_poses/bone_2.png filter=lfs diff=lfs merge=lfs -text
43
+ sample_poses/bone_34.png filter=lfs diff=lfs merge=lfs -text
44
+ sample_poses/bone_6.png filter=lfs diff=lfs merge=lfs -text
45
+ sample_poses/bone_12.png filter=lfs diff=lfs merge=lfs -text
46
+ sample_poses/bone_14.png filter=lfs diff=lfs merge=lfs -text
47
+ sample_poses/bone_19.png filter=lfs diff=lfs merge=lfs -text
48
+ sample_poses/bone_23.png filter=lfs diff=lfs merge=lfs -text
49
+ sample_poses/bone_33.png filter=lfs diff=lfs merge=lfs -text
50
+ sample_poses/bone_5.png filter=lfs diff=lfs merge=lfs -text
51
+ sample_poses/bone_13.png filter=lfs diff=lfs merge=lfs -text
52
+ sample_poses/bone_16.png filter=lfs diff=lfs merge=lfs -text
53
+ sample_poses/bone_18.png filter=lfs diff=lfs merge=lfs -text
54
+ sample_poses/bone_24.png filter=lfs diff=lfs merge=lfs -text
55
+ sample_poses/bone_27.png filter=lfs diff=lfs merge=lfs -text
56
+ sample_poses/bone_29.png filter=lfs diff=lfs merge=lfs -text
57
+ sample_poses/bone_31.png filter=lfs diff=lfs merge=lfs -text
58
+ sample_poses/bone_10.png filter=lfs diff=lfs merge=lfs -text
59
+ sample_poses/bone_15.png filter=lfs diff=lfs merge=lfs -text
60
+ sample_poses/bone_1.png filter=lfs diff=lfs merge=lfs -text
61
+ sample_poses/bone_20.png filter=lfs diff=lfs merge=lfs -text
62
+ sample_poses/bone_32.png filter=lfs diff=lfs merge=lfs -text
63
+ sample_poses/bone_3.png filter=lfs diff=lfs merge=lfs -text
64
+ sample_poses/bone_7.png filter=lfs diff=lfs merge=lfs -text
65
+ sample_poses/bone_11.png filter=lfs diff=lfs merge=lfs -text
66
+ sample_poses/bone_25.png filter=lfs diff=lfs merge=lfs -text
67
+ sample_poses/bone_26.png filter=lfs diff=lfs merge=lfs -text
68
+ sample_poses/bone_28.png filter=lfs diff=lfs merge=lfs -text
69
+ sample_poses/bone_4.png filter=lfs diff=lfs merge=lfs -text
70
+ sample_poses/bone_9.png filter=lfs diff=lfs merge=lfs -text
71
+ sample_poses/bone_22.png filter=lfs diff=lfs merge=lfs -text
72
+ sample_poses/bone_35.png filter=lfs diff=lfs merge=lfs -text
73
+ sample_poses/bone_8.png filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,3 +1,29 @@
1
  ---
2
  license: openrail++
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: openrail++
3
  ---
4
+
5
+ # ControlNet for Stable Diffusion 2.1
6
+
7
+ ## Picasso Diffusion 1.1 (OpenPose)
8
+
9
+ **Note: This model is a proof of concept. You can test the model with sample poses only. Give me more training data.**
10
+
11
+ 1. Install [Web UI](https://github.com/AUTOMATIC1111/stable-diffusion-webui).
12
+ 1. Install [ControlNet extention](https://github.com/Mikubill/sd-webui-controlnet).
13
+ 1. Download [Picasso Diffusion 1.1](https://huggingface.co/alfredplpl/picasso-diffusion-1-1/blob/main/v1-1.safetensors)
14
+ 1. Move it into the folder: models -> Stable-diffusion .
15
+ 1. Download the [model](control_picasso11_openpose.ckpt) and the [config](control_picasso11_openpose.yaml).
16
+ 1. Move them into the folder: extentions -> sd-webui-controlnet -> models.
17
+ 1. Use it in the web ui with the sample pose.
18
+
19
+ ## Examples
20
+
21
+ ![girl_1](girl_1.png)
22
+ ```
23
+ anime, a girl
24
+ Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 3038848424, Size: 768x768, Model hash: 5eb33121a0, Model: picasso_diffusion, ControlNet Enabled: True, ControlNet Module: none, ControlNet Model: control_picasso11_openpose [80042ed5], ControlNet Weight: 1, ControlNet Guidance Strength: 1
25
+ ```
26
+ ![pose_1](pose_1.png)
27
+
28
+
29
+
control_picasso11_openpose.ckpt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4845d4d1faea6d754beb110514f8e615d5ca0e6b4812eeb3b3bd475139d6ce75
3
+ size 3336262761
control_picasso11_openpose.yaml ADDED
@@ -0,0 +1,85 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ model:
2
+ target: cldm.cldm.ControlLDM
3
+ params:
4
+ linear_start: 0.00085
5
+ linear_end: 0.0120
6
+ num_timesteps_cond: 1
7
+ log_every_t: 200
8
+ timesteps: 1000
9
+ first_stage_key: "jpg"
10
+ cond_stage_key: "txt"
11
+ control_key: "hint"
12
+ image_size: 64
13
+ channels: 4
14
+ cond_stage_trainable: false
15
+ conditioning_key: crossattn
16
+ monitor: val/loss_simple_ema
17
+ scale_factor: 0.18215
18
+ use_ema: False
19
+ only_mid_control: False
20
+
21
+ control_stage_config:
22
+ target: cldm.cldm.ControlNet
23
+ params:
24
+ use_checkpoint: True
25
+ image_size: 32 # unused
26
+ in_channels: 4
27
+ hint_channels: 3
28
+ model_channels: 320
29
+ attention_resolutions: [ 4, 2, 1 ]
30
+ num_res_blocks: 2
31
+ channel_mult: [ 1, 2, 4, 4 ]
32
+ num_head_channels: 64 # need to fix for flash-attn
33
+ use_spatial_transformer: True
34
+ use_linear_in_transformer: True
35
+ transformer_depth: 1
36
+ context_dim: 1024
37
+ legacy: False
38
+
39
+ unet_config:
40
+ target: cldm.cldm.ControlledUnetModel
41
+ params:
42
+ use_checkpoint: True
43
+ image_size: 32 # unused
44
+ in_channels: 4
45
+ out_channels: 4
46
+ model_channels: 320
47
+ attention_resolutions: [ 4, 2, 1 ]
48
+ num_res_blocks: 2
49
+ channel_mult: [ 1, 2, 4, 4 ]
50
+ num_head_channels: 64 # need to fix for flash-attn
51
+ use_spatial_transformer: True
52
+ use_linear_in_transformer: True
53
+ transformer_depth: 1
54
+ context_dim: 1024
55
+ legacy: False
56
+
57
+ first_stage_config:
58
+ target: ldm.models.autoencoder.AutoencoderKL
59
+ params:
60
+ embed_dim: 4
61
+ monitor: val/rec_loss
62
+ ddconfig:
63
+ #attn_type: "vanilla-xformers"
64
+ double_z: true
65
+ z_channels: 4
66
+ resolution: 256
67
+ in_channels: 3
68
+ out_ch: 3
69
+ ch: 128
70
+ ch_mult:
71
+ - 1
72
+ - 2
73
+ - 4
74
+ - 4
75
+ num_res_blocks: 2
76
+ attn_resolutions: []
77
+ dropout: 0.0
78
+ lossconfig:
79
+ target: torch.nn.Identity
80
+
81
+ cond_stage_config:
82
+ target: ldm.modules.encoders.modules.FrozenOpenCLIPEmbedder
83
+ params:
84
+ freeze: True
85
+ layer: "penultimate"
girl_1.png ADDED

Git LFS Details

  • SHA256: e5344c8dc5b0276869a1e26960113baf75189a5982d279eaeff1f856e6d43671
  • Pointer size: 131 Bytes
  • Size of remote file: 341 kB
pose_1.png ADDED

Git LFS Details

  • SHA256: 0b80c8ea23b298fa91fb343539ba146017b18b5d1cc0b3fb03b32f92502b2507
  • Pointer size: 131 Bytes
  • Size of remote file: 234 kB
sample_poses/bone_0.png ADDED

Git LFS Details

  • SHA256: 0b80c8ea23b298fa91fb343539ba146017b18b5d1cc0b3fb03b32f92502b2507
  • Pointer size: 131 Bytes
  • Size of remote file: 234 kB
sample_poses/bone_1.png ADDED

Git LFS Details

  • SHA256: 43365c8bd95d016dd121f595058998a031a1908fbae8561c7d64d3a933f88109
  • Pointer size: 131 Bytes
  • Size of remote file: 232 kB
sample_poses/bone_10.png ADDED

Git LFS Details

  • SHA256: d37bada96c67d24ce3a90e4acd53698e039b4e238bee9bf90d080f6f8800b24c
  • Pointer size: 131 Bytes
  • Size of remote file: 196 kB
sample_poses/bone_11.png ADDED

Git LFS Details

  • SHA256: 6db06aea7445dac0a2ae7e6d16495a6015826ce032ec63e89e10747d90e80ea5
  • Pointer size: 131 Bytes
  • Size of remote file: 226 kB
sample_poses/bone_12.png ADDED

Git LFS Details

  • SHA256: e00d0791324dcfd199d254e81f95c0d79cd50119d89481f420fd8584cfe0d292
  • Pointer size: 131 Bytes
  • Size of remote file: 201 kB
sample_poses/bone_13.png ADDED

Git LFS Details

  • SHA256: 5cabbaf46adaabc17f611fb75cf0a9addecddb9f35a64588b4727912d25f1542
  • Pointer size: 131 Bytes
  • Size of remote file: 228 kB
sample_poses/bone_14.png ADDED

Git LFS Details

  • SHA256: 3578323189cc1bdeb3e3dd00706ff42762acfe3b784c8f90490c10c22380d0da
  • Pointer size: 131 Bytes
  • Size of remote file: 236 kB
sample_poses/bone_15.png ADDED

Git LFS Details

  • SHA256: b04a7f970a4f23554d63129ffef1c19a9fb02b14ea48013d500a9110c92dae43
  • Pointer size: 131 Bytes
  • Size of remote file: 195 kB
sample_poses/bone_16.png ADDED

Git LFS Details

  • SHA256: 4262c1266d6552e410895a89668ffdff65331fa0a035aa6964d967bb84d981e0
  • Pointer size: 131 Bytes
  • Size of remote file: 212 kB
sample_poses/bone_17.png ADDED

Git LFS Details

  • SHA256: 2991a9c13aea861e506c244d6ce7291e29c5018f794dad8adcccfe4733bba1cf
  • Pointer size: 131 Bytes
  • Size of remote file: 201 kB
sample_poses/bone_18.png ADDED

Git LFS Details

  • SHA256: 00c7dd7252d958bc3f2522541d039aa3a16655537778a2a3c2b94c592e3c79c7
  • Pointer size: 131 Bytes
  • Size of remote file: 215 kB
sample_poses/bone_19.png ADDED

Git LFS Details

  • SHA256: c4455c29066286471da4014598663b2c047e0360bb89dd1b95bb0086d8e3dd41
  • Pointer size: 131 Bytes
  • Size of remote file: 202 kB
sample_poses/bone_2.png ADDED

Git LFS Details

  • SHA256: dfc136002a18ebb3d6760e59a9ad9c3e25784d9d56224644ac47aa09213dd05a
  • Pointer size: 131 Bytes
  • Size of remote file: 224 kB
sample_poses/bone_20.png ADDED

Git LFS Details

  • SHA256: 1f6a6f0fbce991efe573b9819616e7913418e46748d057b36892e9a595f615e0
  • Pointer size: 131 Bytes
  • Size of remote file: 187 kB
sample_poses/bone_21.png ADDED

Git LFS Details

  • SHA256: f3235f52aff50e7fe41f7d8aa86035994e3fffeb44a37cac73db86953abb61fa
  • Pointer size: 131 Bytes
  • Size of remote file: 198 kB
sample_poses/bone_22.png ADDED

Git LFS Details

  • SHA256: d2e1c35c84913588bc213eaeb9fae80ee203a9e0acc107fcf78415074ffd38f7
  • Pointer size: 131 Bytes
  • Size of remote file: 195 kB
sample_poses/bone_23.png ADDED

Git LFS Details

  • SHA256: e537c5cfecdf3424a2ff702f8ede2a043feca7001de6fc298b9c23f7f0527b71
  • Pointer size: 131 Bytes
  • Size of remote file: 201 kB
sample_poses/bone_24.png ADDED

Git LFS Details

  • SHA256: 7754f6bf12fe1bc5d12e733541db0412b05d9a5d495dcaf3c142fea9fc688b7d
  • Pointer size: 131 Bytes
  • Size of remote file: 198 kB
sample_poses/bone_25.png ADDED

Git LFS Details

  • SHA256: e40dbd2b9fd8483e7aea5efaf9e4b168bb8fdfa1f8fa923b386e8ec54449fdf3
  • Pointer size: 131 Bytes
  • Size of remote file: 196 kB
sample_poses/bone_26.png ADDED

Git LFS Details

  • SHA256: 7269b0f188eba8c87930f5e541dd1daaffc3f33fc1eef7f64bc3a2cf92d61294
  • Pointer size: 131 Bytes
  • Size of remote file: 204 kB
sample_poses/bone_27.png ADDED

Git LFS Details

  • SHA256: fd68a784537b5cd8182aa0592dda3dbb220fe9c126b832ac7fc0e55ae403a17b
  • Pointer size: 131 Bytes
  • Size of remote file: 204 kB
sample_poses/bone_28.png ADDED

Git LFS Details

  • SHA256: e0820c0030312ebab7ad37297fcdb7b727bddaeb407b92a73eae7b4e61431d40
  • Pointer size: 131 Bytes
  • Size of remote file: 186 kB
sample_poses/bone_29.png ADDED

Git LFS Details

  • SHA256: 6768783ecc6384f15ee36f252771d84cacd3c1821d84a1ad8ada9aafdb153fb8
  • Pointer size: 131 Bytes
  • Size of remote file: 189 kB
sample_poses/bone_3.png ADDED

Git LFS Details

  • SHA256: acbbccb2b053716006737346e53772a07230babab9025fdfe308f061a300e024
  • Pointer size: 131 Bytes
  • Size of remote file: 214 kB
sample_poses/bone_30.png ADDED

Git LFS Details

  • SHA256: adabfdfb674cb276e417d244178ad369098de7a2d68f60275d089bf9f6a180ac
  • Pointer size: 131 Bytes
  • Size of remote file: 190 kB
sample_poses/bone_31.png ADDED

Git LFS Details

  • SHA256: a40033f8c51481f0b855f74c4e92b785799d4a0794fb4a85d9d3cae551091093
  • Pointer size: 131 Bytes
  • Size of remote file: 199 kB
sample_poses/bone_32.png ADDED

Git LFS Details

  • SHA256: 91a78c4870fb89537cea30ab596e41ba68bc7bd6a9f78cbb53a85cc651cfd414
  • Pointer size: 131 Bytes
  • Size of remote file: 183 kB
sample_poses/bone_33.png ADDED

Git LFS Details

  • SHA256: c7d9dcc3f1890774e26369927306a2b7a4a8a07b8152bd97ec3b8579afe8bb16
  • Pointer size: 131 Bytes
  • Size of remote file: 192 kB
sample_poses/bone_34.png ADDED

Git LFS Details

  • SHA256: 4bd1a520ede61048138c6fa0a429c92ebf2744128e3832cd0561bb94fa06c020
  • Pointer size: 131 Bytes
  • Size of remote file: 179 kB
sample_poses/bone_35.png ADDED

Git LFS Details

  • SHA256: 818a2f964d4c1cbae127200acf0ac05bf2ee1468a755664e0ef9ce5e8835e980
  • Pointer size: 131 Bytes
  • Size of remote file: 203 kB
sample_poses/bone_4.png ADDED

Git LFS Details

  • SHA256: 8a356848c683a13b0d472710e31947dae65ecf8bbbdb76a538bc375b81136b1b
  • Pointer size: 131 Bytes
  • Size of remote file: 205 kB
sample_poses/bone_5.png ADDED

Git LFS Details

  • SHA256: eba4dd94657eeae549ae85a3be2ece393e5c904c860fa6ad52c084d7000b3353
  • Pointer size: 131 Bytes
  • Size of remote file: 215 kB
sample_poses/bone_6.png ADDED

Git LFS Details

  • SHA256: b4cb3c41986110a2e56215f1608ec37b2ff08ad91756a81532a96eab92975fd6
  • Pointer size: 131 Bytes
  • Size of remote file: 227 kB
sample_poses/bone_7.png ADDED

Git LFS Details

  • SHA256: 9841a99605a37b34c7408a182085be7cd3bf0c9d8f7a13eaf5338dc37cd81754
  • Pointer size: 131 Bytes
  • Size of remote file: 228 kB
sample_poses/bone_8.png ADDED

Git LFS Details

  • SHA256: 4e14137566af88197b77feab604aaaec4be275fb99919fe2f1681552c2cdb125
  • Pointer size: 131 Bytes
  • Size of remote file: 211 kB
sample_poses/bone_9.png ADDED

Git LFS Details

  • SHA256: 299fc2ee1f67cc565fccefea0a128345290281ea899b3c60e4a08a02dd578550
  • Pointer size: 131 Bytes
  • Size of remote file: 210 kB