Abstract
Breeze-7B is an open-source language model based on Mistral-7B, designed to address the need for improved language comprehension and chatbot-oriented capabilities in Traditional Chinese. This technical report provides an overview of the additional pretraining, finetuning, and evaluation stages for the Breeze-7B model. The Breeze-7B family of base and chat models exhibits good performance on language comprehension and chatbot-oriented tasks, reaching the top in several benchmarks among models comparable in its complexity class.
Models citing this paper 19
Browse 19 models citing this paperDatasets citing this paper 0
No dataset linking this paper
Cite arxiv.org/abs/2403.02712 in a dataset README.md to link it from this page.
Spaces citing this paper 12
Collections including this paper 0
No Collection including this paper
Add this paper to a
collection
to link it from this page.