English
Chinese
Zhoues commited on
Commit
22f02bf
1 Parent(s): 5257882

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -1,3 +1,36 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ datasets:
4
+ - Zhoues/Goal-Drift-Dataset
5
+ language:
6
+ - en
7
+ - zh
8
  ---
9
+
10
+ # Model Card for VAR (Visual AutoRegressive) Transformers 🔥
11
+
12
+ <!-- Provide a quick summary of what the model is/does. -->
13
+
14
+ [![arXiv](https://img.shields.io/badge/arXiv%20papr-2403.12037-b31b1b.svg)](https://arxiv.org/abs/2403.12037)
15
+
16
+ [![project page](https://img.shields.io/badge/Play%20with%20MineDreamer%21-MineDreamer%20project%20page-lightblue)](https://sites.google.com/view/minedreamer/main)
17
+
18
+ MineDreamer is an instructable embodied agent for simulated control and it is developed on top of recent advances in Multimodal Large Language Models (MLLMs) and diffusion models!
19
+
20
+
21
+
22
+ <p align="center">
23
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/63f08dc79cf89c9ed1bb89cd/S62I1Tn5qz5qJ3IkgMHH8.png" width=93%>
24
+ <p>
25
+
26
+ MineDreamer can follow instructions steadily by employing a Chain-of-Imagination (CoI) mechanism to envision the step-by-step process of executing instructions and translating imaginations into more precise visual prompts tailored to the current state; subsequently, it generates keyboard-and-mouse actions to efficiently achieve these imaginations,
27
+
28
+
29
+ <p align="center">
30
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/63f08dc79cf89c9ed1bb89cd/LJxBMChCFng_RkXwUotfk.png" width=93%>
31
+ <p>
32
+
33
+
34
+ **This repo is used for hosting MineDreamer's InstructPix2Pix checkpoints, which is not only the baseline checkpoints but the training stage 2 checkpoints for Imaginator as well.**
35
+
36
+ For more details or tutorials see https://github.com/Zhoues/MineDreamer.