File size: 2,986 Bytes
e81125c
 
 
 
 
24d3e63
 
 
 
 
 
 
 
 
 
 
 
c954d86
 
b1beef4
c954d86
b1beef4
c954d86
24d3e63
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8f1c323
 
24d3e63
 
 
 
 
 
8f1c323
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
---
tags:
- stable-diffusion
- stable-diffusion-xl
---

# Nekoray-XL-V0.7


## Model Card


NekoRay v0.7 is a SDXL checkpoint finetuned on SDXL 1.0, using 1.5m quality-tagged images from a selection of image sites. It aims to be the next Waifu-diffusion, creating generation freedom for the SD community.

The project is a WIP, and further checkpoints with enhanced augmentations & more images is currently being developed.


Two models trained on the same dataset/hardware/hyperparameters are currently available:

- **[fp16mixed](https://huggingface.co/trojblue/nekoray-xl-fulldan-bench-1.5m/blob/main/nekoray-xl-1.5m-fp16mixed_e02.safetensors)**: 2epoch, half-precision

- **[32full](https://huggingface.co/trojblue/nekoray-xl-fulldan-bench-1.5m/blob/main/nekoray-xl-1.5m-pdg32_e02.safetensors)**: 1.7epoch (still training), full-precision


## Usage:

It's recommended to use the *exact* same resolution as specified below, since the original SDXL doesn't perform well out of these resolutions. For prompt danbooru-styled caption is preferred.



we use the same aspect ratio as the original SDXL:

| Height | Width | Aspect Ratio |
| ------ | ----- | ------------ |
| 512    | 2048  | 0.25         |
| 512    | 1984  | 0.26         |
| 512    | 1920  | 0.27         |
| 512    | 1856  | 0.28         |
| 576    | 1792  | 0.32         |
| 576    | 1728  | 0.33         |
| 576    | 1664  | 0.35         |
| 640    | 1600  | 0.4          |
| 640    | 1536  | 0.42         |
| 704    | 1472  | 0.48         |
| 704    | 1408  | 0.5          |
| 704    | 1344  | 0.52         |
| 768    | 1344  | 0.57         |
| 768    | 1280  | 0.6          |
| 832    | 1216  | 0.68         |
| 832    | 1152  | 0.72         |
| 896    | 1152  | 0.78         |
| 896    | 1088  | 0.82         |
| 960    | 1088  | 0.88         |
| 960    | 1024  | 0.94         |
| 1024   | 1024  | 1.0          |
| 1024   | 960   | ...          |


for prompts the following keywords are appended for better separation of genres:

```
'sensitive-rated', 'questionable-rated', 'explicit-rated'
```

The models are intended to be used as a pretrained checkpoint, and further finetuning is **strongly recommended** for downstream use. For more info 'Finetuning' part below:

## Finetuning

finetuning on SDXL inherently provides better clarity and reduced 'noisinesss' on higher resolutions compared to sd1.4. We've got some good results for further finetuning on various anime-related subject matters, including but not limited to:

- style finetunes (tested on nijijourney images & pvc/figure datasets)

- character finetunes (tested on 8 Blue Archive characters)

- concept finetunes (tested on nsfw gestures)

Actual samples of downstream finetunes will be posted once we get the finetuners' conscents.

## License

The model (which is still very WIP) is intended to be used as a foundation for various downstream finetunes. The license is under discussion but it would be generally follow Openrail-M agreements.