Upload folder using huggingface_hub (#2)
Browse files- 0777c343c8f766d0106dc1a78c525ca31adf806cb77434246319e9653c319750 (fede7a8222a6b5ea41715a1e719f9cc9aa084c05)
- a830529ec745def5596eee3148666b9a548dfbf9ae63e988fc9f6d32e06d6fc5 (823aefe0a3116f5992221c5f5f4b3bf2e340c36b)
- README.md +5 -2
- config.json +1 -1
- model/optimized_model.pkl +2 -2
- model/smash_config.json +1 -1
- plots.png +0 -0
README.md
CHANGED
@@ -35,7 +35,10 @@ metrics:
|
|
35 |
|
36 |
![image info](./plots.png)
|
37 |
|
38 |
-
|
|
|
|
|
|
|
39 |
|
40 |
## Setup
|
41 |
|
@@ -75,7 +78,7 @@ The configuration info are in `config.json`.
|
|
75 |
|
76 |
## License
|
77 |
|
78 |
-
We follow the same license as the original model. Please check the license of the original model segmind
|
79 |
|
80 |
## Want to compress other models?
|
81 |
|
|
|
35 |
|
36 |
![image info](./plots.png)
|
37 |
|
38 |
+
**Important remarks:**
|
39 |
+
- The quality of the model output might slightly vary compared to the base model. There might be minimal quality loss.
|
40 |
+
- These results were obtained on NVIDIA A100-PCIE-40GB with configuration described in config.json and are obtained after a hardware warmup. Efficiency results may vary in other settings (e.g. other hardware, image size, batch size, ...).
|
41 |
+
- You can request premium access to more compression methods and tech support for your specific use-cases [here](https://z0halsaff74.typeform.com/pruna-access?typeform-source=www.pruna.ai).
|
42 |
|
43 |
## Setup
|
44 |
|
|
|
78 |
|
79 |
## License
|
80 |
|
81 |
+
We follow the same license as the original model. Please check the license of the original model segmind/Segmind-Vega before using this model.
|
82 |
|
83 |
## Want to compress other models?
|
84 |
|
config.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"pruners": "None", "pruning_ratio": 0.0, "factorizers": "None", "quantizers": "None", "n_quantization_bits": 32, "output_deviation": 0.005, "compilers": "['diffusers2']", "static_batch": true, "static_shape": true, "controlnet": "None", "unet_dim": 4, "device": "cuda", "save_dir": "/ceph/hdd/staff/charpent/models/.models/optimized_model", "batch_size": 1, "max_batch_size": 1, "image_height": 1024, "image_width": 1024, "version": "xl-1.0", "scheduler": "DDIM", "task": "txt2imgxl", "model_name": "segmind/Segmind-Vega", "weight_name": "None", "save_load_fn": "stable_fast"}
|
|
|
1 |
+
{"pruners": "None", "pruning_ratio": 0.0, "factorizers": "None", "quantizers": "None", "n_quantization_bits": 32, "output_deviation": 0.005, "compilers": "['diffusers2', 'tiling', 'step_caching']", "static_batch": true, "static_shape": true, "controlnet": "None", "unet_dim": 4, "device": "cuda", "save_dir": "/ceph/hdd/staff/charpent/models/.models/optimized_model", "batch_size": 1, "max_batch_size": 1, "image_height": 1024, "image_width": 1024, "version": "xl-1.0", "scheduler": "DDIM", "task": "txt2imgxl", "model_name": "segmind/Segmind-Vega", "weight_name": "None", "save_load_fn": "stable_fast"}
|
model/optimized_model.pkl
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5b2b07806c98261ee281b888cedcc3e2e0aae3495540657bc653b5b4f71ad37c
|
3 |
+
size 3298149510
|
model/smash_config.json
CHANGED
@@ -1 +1 @@
|
|
1 |
-
{"api_key": "pruna_c4c77860c62a2965f6bc281841ee1d7bd3", "verify_url": "http://johnrachwan.pythonanywhere.com", "smash_config": {"pruners": "None", "pruning_ratio": 0.0, "factorizers": "None", "quantizers": "None", "n_quantization_bits": 32, "output_deviation": 0.005, "compilers": "['diffusers2']", "static_batch": true, "static_shape": true, "controlnet": "None", "unet_dim": 4, "device": "cuda", "cache_dir": "/ceph/hdd/staff/charpent/.cache/models", "save_dir": "/ceph/hdd/staff/charpent/models/.models/optimized_model", "batch_size": 1, "max_batch_size": 1, "image_height": 1024, "image_width": 1024, "version": "xl-1.0", "scheduler": "DDIM", "task": "txt2imgxl", "model_name": "segmind/Segmind-Vega", "weight_name": "None", "save_load_fn": "stable_fast"}}
|
|
|
1 |
+
{"api_key": "pruna_c4c77860c62a2965f6bc281841ee1d7bd3", "verify_url": "http://johnrachwan.pythonanywhere.com", "smash_config": {"pruners": "None", "pruning_ratio": 0.0, "factorizers": "None", "quantizers": "None", "n_quantization_bits": 32, "output_deviation": 0.005, "compilers": "['diffusers2', 'tiling', 'step_caching']", "static_batch": true, "static_shape": true, "controlnet": "None", "unet_dim": 4, "device": "cuda", "cache_dir": "/ceph/hdd/staff/charpent/.cache/models", "save_dir": "/ceph/hdd/staff/charpent/models/.models/optimized_model", "batch_size": 1, "max_batch_size": 1, "image_height": 1024, "image_width": 1024, "version": "xl-1.0", "scheduler": "DDIM", "task": "txt2imgxl", "model_name": "segmind/Segmind-Vega", "weight_name": "None", "save_load_fn": "stable_fast"}}
|
plots.png
CHANGED