File size: 2,110 Bytes
297973e 15f565a 297973e |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 |
# FaceMaker-V0
## News and Update π₯π₯π₯
- Dec.28, 2024. **[FaceMaker-V0](https://github.com/ddw2AIGROUP2CQUPT/Face-MakeUp), is released!πππ**
## Demo
<video controls autoplay src="https://cdn-uploads.huggingface.co/production/uploads/663f06e01cd68975883a353e/FERPHxBFOZXZtXtM_VhPw.mp4"></video>
## FaceCaptionHQ-4M
We constructed a large-scale facial image-text dataset for facial image generation task.
[![facecaption](assets/facecaption.png)](https://huggingface.co/datasets/OpenFace-CQUPT/FaceCaption-15M)
## Model
![Model](assets/Model.png)
## Results
**Unsplash-Face**
| Method | CLIP-T β | CLIP-I β | DINO β | FaceSim β | FID β | Attr_c β | VLM-score β |
| ----------------- | -------- | -------- | -------- | --------- | --------- | -------- | ----------- |
| Ip-Adapter.(2023) | 27.7 | 64.9 | 37.6 | 53.2 | 226.9 | 3.0 | 65.3 |
| PhotoMaker.(2023) | 28.2 | 56.5 | 26.2 | 20.7 | 224.4 | 2.2 | 60.1 |
| InstantID.(2024) | 24.8 | 78.0 | 49.4 | **71.2** | 178.7 | 3.8 | 54.8 |
| Pulid.(2024) | **29.3** | 46.3 | 21.0 | 24.3 | 284.5 | 2.4 | 36.5 |
| Ours | 22.3 | **82.1** | **73.2** | 69.2 | **130.1** | **4.0** | **79.6** |
**FaceCaption**
| Method | CLIP-T β | CLIP-I β | DINO β | FaceSim β | FID β | Attr_c β | VLM-score β |
| ----------------- | --------- | -------- | -------- | --------- | -------- | -------- | ----------- |
| Ip-Adapter.(2023) | 26.78 | 69.7 | 48.0 | 59.2 | 195.4 | 3.2 | 63.2 |
| PhotoMaker.(2023) | 28.12 | 50.5 | 25.9 | 22.1 | 237.6 | 2.2 | 54.5 |
| InstantID.(2024) | 24.29 | 67.2 | 50.1 | 75.5 | 166.5 | 5.3 | 53.7 |
| Pulid.(2024) | **29.21** | 36.2 | 13.2 | 22.8 | 298.5 | 2.1 | 43.5 |
| Ours | 21.96 | **87.4** | **79.4** | **77.8** | **95.4** | **6.3** | **73.1** |
|