MediaTek-Research
/

Breeze-7B-Instruct-v0_1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

YC-Chen commited on Jan 10

Commit

519cc77

•

1 Parent(s): 363db8f

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -43,6 +43,17 @@ and is comparable with Mistral-7B-Instruct-v0.1 on MMLU and MT-Bench in English.
 \* Few-shot learning cannot effectively guide the model to generate the proper answer.
 ## Inference Performance
 | Models                                                             | Speed (char/sec)  |Max Input Length (TC Char)|

 \* Few-shot learning cannot effectively guide the model to generate the proper answer.
+| Models on 5 shot TMMLU+                             | STEM         | Social Science | Humanities | Other      |
+|-----------------------------------------------------|--------------|----------------|------------|------------|
+| MediaTek-Research/Breeze-7B-Base-v0.1               | 35.74        | 46.08          | 40.29      | 40.35      |
+| mistralai/Mistral-7B-v0.1                           |              |                |            |            |
+| yentinglin/Taiwan-LLM-7B-v2.1-base                  |              |                |            |            |
+| yentinglin/Taiwan-LLM-13B-v2.0-base                 |              |                |            |            |
+| 01-ai/Yi-6B                                         |              |                |            |            |
+| 01-ai/Yi-34B                                        |              |                |            |            |
+| Qwen/Qwen-7B                                        |              |                |            |            |
+| Qwen/Qwen-14B                                       |              |                |            |            |
 ## Inference Performance
 | Models                                                             | Speed (char/sec)  |Max Input Length (TC Char)|