Ai-tensa commited on
Commit
789b60c
1 Parent(s): 5682327

model upload

Browse files
Files changed (3) hide show
  1. .gitattributes +1 -0
  2. README.md +19 -0
  3. mistVAE.safetensors +3 -0
.gitattributes CHANGED
@@ -19,6 +19,7 @@
19
  *.pb filter=lfs diff=lfs merge=lfs -text
20
  *.pickle filter=lfs diff=lfs merge=lfs -text
21
  *.pkl filter=lfs diff=lfs merge=lfs -text
 
22
  *.pt filter=lfs diff=lfs merge=lfs -text
23
  *.pth filter=lfs diff=lfs merge=lfs -text
24
  *.rar filter=lfs diff=lfs merge=lfs -text
 
19
  *.pb filter=lfs diff=lfs merge=lfs -text
20
  *.pickle filter=lfs diff=lfs merge=lfs -text
21
  *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.png filter=lfs diff=lfs merge=lfs -text
23
  *.pt filter=lfs diff=lfs merge=lfs -text
24
  *.pth filter=lfs diff=lfs merge=lfs -text
25
  *.rar filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,3 +1,22 @@
1
  ---
2
  license: mit
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
3
  ---
4
+
5
+ # VAE for high-resolution image generation with stable diffusion
6
+
7
+ This VAE is trained by adding only one step of noise to the latent and denoising the latent with U-net, to avoid oversensitivity to latent.
8
+ This process reduces the possibility to describe too much detail in some objects, such as plants and eyes, etc., than in the surroundings when generated at high resolution.
9
+ The dataset consists of 19k images tagged nijijourneyv5 and published on the web, and was denoised using [the same dataset trained models](https://huggingface.co/Ai-tensa/FlexWaifu).
10
+
11
+ ## sample
12
+
13
+ ![](xyz_grid-0011-1798392412.png)
14
+
15
+ ## training details
16
+
17
+ - 19k 1images
18
+ - 2 epochs
19
+ - Aspect Ratio Bucketing based on 768p resolution
20
+ - multires noise
21
+ - lr: 1e-5
22
+ - precision: fp32
mistVAE.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f6dbafc61e1e6751dbe2f91a1729d562f4e55cd90b00d7a2bd9aa1aa8f024ded
3
+ size 334640988