Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,49 @@
|
|
1 |
-
---
|
2 |
-
license: mit
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: mit
|
3 |
+
datasets:
|
4 |
+
- ILSVRC/imagenet-1k
|
5 |
+
language:
|
6 |
+
- en
|
7 |
+
base_model:
|
8 |
+
- haoosz/BiGR
|
9 |
+
---
|
10 |
+
|
11 |
+
This is the official model release for the paper:
|
12 |
+
|
13 |
+
[**BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities**]()
|
14 |
+
|
15 |
+
Please download the pretrained weights for tokenizers and BiGR models to run our [codes](https://github.com/haoosz/BiGR).
|
16 |
+
|
17 |
+
### Binary Autoencoder
|
18 |
+
We train Binary Autoencoder (B-AE) by adapting the [official code](https://github.com/ZeWang95/BinaryLatentDiffusion) of [Binary Latent Diffusion](https://arxiv.org/abs/2304.04820). We provide pretrained weights for different configurations.
|
19 |
+
|
20 |
+
**256x256 resolution**
|
21 |
+
|
22 |
+
| B-AE | Size | Checkpoint |
|
23 |
+
| :---- | :---: | :----------: |
|
24 |
+
| d24 | 332M | [download](https://huggingface.co/haoosz/BiGR/resolve/main/bae/bae_d24/binaryae_ema_1000000.th?download=true) |
|
25 |
+
| d32 | 332M | [download](https://huggingface.co/haoosz/BiGR/resolve/main/bae/bae_d32/binaryae_ema_950000.th?download=true) |
|
26 |
+
|
27 |
+
**512x512 resolution**
|
28 |
+
|
29 |
+
| B-AE | Size | Checkpoint |
|
30 |
+
| :------- | :---: | :----------: |
|
31 |
+
| d32-512 | 315M | [download](https://huggingface.co/haoosz/BiGR/resolve/main/bae/bae_d32_512/binaryae_ema_720000.th?download=true) |
|
32 |
+
|
33 |
+
### BiGR models ✨
|
34 |
+
We provide pretrained weights for BiGR models in various sizes.
|
35 |
+
|
36 |
+
**256x256 resolution**
|
37 |
+
|
38 |
+
| Model | B-AE | Size | Checkpoint |
|
39 |
+
| :----------------- | :---: | :---: | :---------: |
|
40 |
+
| BiGR-L-d24 | d24 | 1.35G | [download](https://huggingface.co/haoosz/BiGR/resolve/main/gpt/bigr_L_d24.pt?download=true) |
|
41 |
+
| BiGR-XL-d24 | d24 | 3.20G | [download](https://huggingface.co/haoosz/BiGR/resolve/main/gpt/bigr_XL_d24.pt?download=true) |
|
42 |
+
| BiGR-XXL-d24 | d24 | 5.92G | [download](https://huggingface.co/haoosz/BiGR/resolve/main/gpt/bigr_XXL_d24.pt?download=true) |
|
43 |
+
| BiGR-XXL-d32 | d32 | 5.92G | [download](https://huggingface.co/haoosz/BiGR/resolve/main/gpt/bigr_XXL_d32.pt?download=true) |
|
44 |
+
|
45 |
+
**512x512 resolution**
|
46 |
+
|
47 |
+
| Model | B-AE | Size | Checkpoint |
|
48 |
+
| :----------------- | :---------: | :---: | :--------: |
|
49 |
+
| BiGR-L-d32-res512 | d32-res512 | 1.49G | [download](https://huggingface.co/haoosz/BiGR/resolve/main/gpt/bigr_L_d32_512.pt?download=true) |
|