Zhoues
/

MineDreamer-InstructPix2Pix-Unet

Model card Files Files and versions Community

Zhoues commited on Apr 6

Commit

22f02bf

•

1 Parent(s): 5257882

Update README.md

Files changed (1) hide show

README.md +33 -0

README.md CHANGED Viewed

@@ -1,3 +1,36 @@
 ---
 license: apache-2.0
 ---

 ---
 license: apache-2.0
+datasets:
+- Zhoues/Goal-Drift-Dataset
+language:
+- en
+- zh
 ---
+# Model Card for VAR (Visual AutoRegressive) Transformers 🔥
+<!-- Provide a quick summary of what the model is/does. -->
+[![arXiv](https://img.shields.io/badge/arXiv%20papr-2403.12037-b31b1b.svg)](https://arxiv.org/abs/2403.12037)
+[![project page](https://img.shields.io/badge/Play%20with%20MineDreamer%21-MineDreamer%20project%20page-lightblue)](https://sites.google.com/view/minedreamer/main)
+MineDreamer is an instructable embodied agent for simulated control and it is developed on top of recent advances in Multimodal Large Language Models (MLLMs) and diffusion models!
+<p align="center">
+<img src="https://cdn-uploads.huggingface.co/production/uploads/63f08dc79cf89c9ed1bb89cd/S62I1Tn5qz5qJ3IkgMHH8.png" width=93%>
+<p>
+MineDreamer can follow instructions steadily by employing a Chain-of-Imagination (CoI) mechanism to envision the step-by-step process of executing instructions and translating imaginations into more precise visual prompts tailored to the current state; subsequently, it generates keyboard-and-mouse actions to efficiently achieve these imaginations,
+<p align="center">
+<img src="https://cdn-uploads.huggingface.co/production/uploads/63f08dc79cf89c9ed1bb89cd/LJxBMChCFng_RkXwUotfk.png" width=93%>
+<p>
+**This repo is used for hosting MineDreamer's InstructPix2Pix checkpoints, which is not only the baseline checkpoints but the training stage 2 checkpoints for Imaginator as well.**
+For more details or tutorials see https://github.com/Zhoues/MineDreamer.