--- title: README emoji: ๐Ÿš€ colorFrom: blue colorTo: gray sdk: static pinned: true --- # ๐Ÿ™ Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations Project Page: [https://mathoctopus.github.io/](https://mathoctopus.github.io/) Paper: [https://arxiv.org/abs/2310.20246.pdf](https://arxiv.org/abs/2310.20246.pdf) Code: [https://github.com/microsoft/MathOctopus](https://github.com/microsoft/MathOctopus) ## About Mathoctopus Mathoctopus is a series of multilingual math reasoning large language models based on LLaMA. This work pioneers exploring and building powerful Multilingual Math Reasoning (xMR) LLMs. To accomplish this, we make the following works: - **MGSM8KInstruct**, the multilingual math reasoning instruction dataset, encompassing ten distinct languages, thus addressing the issue of training data scarcity in xMR tasks. - **MSVAMP**, an out-of-domain xMR test dataset, to conduct a more exhaustive and comprehensive evaluation of the modelโ€™s multilingual mathematical capabilities. - **MathOctopus**, our effective Multilingual Math Reasoning LLMs, training with different strategies, which notably outperform conventional open-source LLMs and exhibit superiority over ChatGPT in few-shot scenarios.