TencentARC
/

DI-PCG

Model card Files Files and versions Community

thuzhaowang commited on 9 days ago

Commit

600036d

•

1 Parent(s): 34205ea

Upload 7 files

Browse files

upload checkpoints

Files changed (7) hide show

README.md +98 -3
basket.pt +3 -0
chair.pt +3 -0
dandelion.pt +3 -0
flower.pt +3 -0
table.pt +3 -0
vase.pt +3 -0

README.md CHANGED Viewed

@@ -1,3 +1,98 @@
----
-license: apache-2.0
----

+<div align="center">
+# DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation
+<a href="https://thuzhaowang.github.io/projects/DI-PCG"><img src="https://img.shields.io/static/v1?label=Project%20Page&message=Github&color=blue&logo=github-pages"></a>&ensp;<a href=""><img src="https://img.shields.io/badge/ArXiv-2404.07191-brightgreen"></a>&ensp;<a href="https://huggingface.co/TencentARC/DI-PCG"><img src="https://img.shields.io/badge/%F0%9F%A4%97%20Model_Card-Huggingface-orange"></a>&ensp;<a href="https://huggingface.co/spaces/TencentARC/DI-PCG"><img src="https://img.shields.io/badge/%F0%9F%A4%97%20Gradio%20Demo-Huggingface-orange"></a><br>
+**[Wang Zhao<sup>1</sup>](https://thuzhaowang.github.io), [Yan-Pei Cao<sup>2</sup>](https://yanpei.me/), [Jiale Xu<sup>1</sup>](https://bluestyle97.github.io/),  [Yuejiang Dong<sup>1,3</sup>](https://scholar.google.com.hk/citations?user=0i7bPj8AAAAJ&hl=zh-CN), [Ying Shan<sup>1</sup>](https://scholar.google.com/citations?user=4oXBp9UAAAAJ&hl=en)**
+<sup>1</sup>ARC Lab, Tencent PCG &ensp;&ensp;<sup>2</sup>VAST &ensp;&ensp;<sup>3</sup>Tsinghua University
+</div>
+---
+## 🚩 Overview
+This repository contains code release for our technical report "DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation".
+<p align="center">
+  <img src="misc/teaser.png" >
+</p>
+## ⚙️ Installation
+First clone this repository with [Infinigen](https://github.com/princeton-vl/infinigen) as the submodule:
+```
+git clone -r https://github.com/TencentARC/DI-PCG.git
+cd DI-PCG
+git submodule update --init --recursive
+```
+We recommend using anaconda to install the dependencies:
+```
+conda create -n di-pcg python=3.10.14
+conda activate di-pcg
+conda install pytorch==2.4.0 torchvision==0.19.0 torchaudio==2.4.0  pytorch-cuda=11.8 -c pytorch -c nvidia
+pip install -r requirements.txt
+```
+## 🚀 Usage
+For a quick start, try the huggingface gradio demo [here](https://huggingface.co/spaces/TencentARC/DI-PCG).
+### Download models
+We provide the pretrained diffusion models for chair, vase, table, basket, flower and dandelion. You can download them from [model card]() and put them in `./pretrained_models/`.
+Alternatively, the inference script will automatically download the pretrained models for you.
+### Local gradio demo
+To run the gradio demo locally, run:
+```
+python app.py
+```
+### Inference
+To run the inference demo, simply use:
+```
+python ./scripts/sample_diffusion.py --config ./configs/demo/chair_demo.yaml
+```
+This script processes all the chair images in the `./examples/chair` folder and saves the generated 3D models and their rendered images in `./logs`.
+To generate other categories, use the corresponding YAML config file such as `vase_demo.yaml`. Currently we supprt `chair`, `table`, `vase`, `basket`, `flower` and `dandelion` generators developped by [Infinigen](https://github.com/princeton-vl/infinigen).
+```
+python ./scripts/sample_diffusion.py --config ./configs/demo/vase_demo.yaml
+```
+### Training
+We train a diffusion model for each procedural generator. The training data is generated by randomly sampling the PCG and render multi-view images. To prepare the training data, run:
+```
+python ./scripts/prepare_data.py --generator ChairFactory --save_root /path/to/save/training/data
+```
+Replace `ChairFactory` with other category options as detailed in the `./scripts/prepare_data.py` file. This script also conducts offline augmentation and saves the extracted DINOv2 features for each image, which may consume a lot of disk storage. You can adjust the number of the generated data and the render configurations accordingly.
+After generating the training data, start the training by:
+```
+python ./scripts/train_diffusion.py --config ./configs/train/chair_train.yaml
+```
+### Use your own PCG
+DI-PCG is general for any procedural generator. To train a diffusion model for your PCG, you need to implement the `get_params_dict`, `update_params`, `spawn_assets`, `finalize_assets` functions and place your PCG in `./core/assets/`. Also change the `num_params` in your training YAML config file.
+If you have any question, feel free to open an issue or contact us.
+## :books: Citation
+If you find our work useful for your research or applications, please cite using this BibTeX:
+```BibTeX
+@article{zhao2024dipcg,
+  title={DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation},
+  author={Zhao, Wang and Cao, Yanpei and Xu, Jiale and Dong, Yuejiang and Shan, Ying},
+  journal={arXiv preprint },
+  year={2024}
+}
+```
+## 🤗 Acknowledgements
+DI-PCG is built on top of some awesome open-source projects: [Infinigen](https://github.com/princeton-vl/infinigen), [Fast-DiT](https://github.com/chuanyangjin/fast-DiT). We sincerely thank them all.

basket.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c6b583ff2eac05c38a879182fd50b08cdcf40cf60f2e16f4f967f76550c49900
+size 122190864

chair.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:cb0459af69b78ec7262510a204eb80e848e06ba259e0669548f059c43c101c8e
+size 122243152

dandelion.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7f9f7050580c8265671afef0ede6c14fbc514abf7f9566c4047f9ba6bb2d4785
+size 122192464

flower.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6f7d27d0844ea52f668ea86fd2bd958764a51b934383bcdc85bcae19eed556d6
+size 122183248

table.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:938c9e1f2643584fe61d42aca77c86d10e95ee09d2f54ed798c2920349ce528e
+size 122198608

vase.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:02ef09f8c3113ea0371e47d5ea87c5c66442aeb2208447e70a567bc93b2510cc
+size 122187856