HuanjinYao
/

Mulberry_llava_8b

Model card Files Files and versions Community

HuanjinYao commited on 1 day ago

Commit

bdb645c

·

verified ·

1 Parent(s): 4ec53c9

Update README.md

Files changed (1) hide show

README.md +24 -3

README.md CHANGED Viewed

@@ -1,3 +1,24 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+---
+# Mulberry
+Mulberry-llava-8b is a step-by-step reasoning model trained on the Mulberry-260K SFT dataset, which was generated through collective knowledge search using CoMCTS.
+For reasoning inference, please refer to our GitHub.
+**Paper**: https://arxiv.org/abs/2412.18319
+**Code**: https://github.com/HJYao00/Mulberry
+## More Details
+**Base Model**: https://huggingface.co/llava-hf/llama3-llava-next-8b-hf
+**Training Framework**: LLaMA-Factory
+**Hardware**: 8x NVIDIA H100