HuanjinYao commited on
Commit
bdb645c
·
verified ·
1 Parent(s): 4ec53c9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -3
README.md CHANGED
@@ -1,3 +1,24 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+
5
+
6
+ # Mulberry
7
+
8
+ Mulberry-llava-8b is a step-by-step reasoning model trained on the Mulberry-260K SFT dataset, which was generated through collective knowledge search using CoMCTS.
9
+
10
+ For reasoning inference, please refer to our GitHub.
11
+
12
+ **Paper**: https://arxiv.org/abs/2412.18319
13
+
14
+ **Code**: https://github.com/HJYao00/Mulberry
15
+
16
+
17
+ ## More Details
18
+ **Base Model**: https://huggingface.co/llava-hf/llama3-llava-next-8b-hf
19
+
20
+ **Training Framework**: LLaMA-Factory
21
+
22
+ **Hardware**: 8x NVIDIA H100
23
+
24
+