Update README.md
Browse files
README.md
CHANGED
@@ -28,16 +28,16 @@ and is comparable with Mistral-7B-Instruct-v0.1 on MMLU and MT-Bench in English.
|
|
28 |
|
29 |
## Base Model Performance
|
30 |
|
31 |
-
| Models | | TMMLU+ (ACC) | DRCD (EM) | MMLU (ACC) |
|
32 |
-
|
33 |
-
| | |TC, Knowledge |TC, Reasoning|EN, Knowledge|
|
34 |
-
| | | 5 shot | 3 shot | 5 shot |
|
35 |
-
| [Yi-34B](https://huggingface.co/01-ai/Yi-34B)| 34B | 63.10 | 84.57 | 77.42 |
|
36 |
-
| [Qwen-14B](https://huggingface.co/01-ai/Qwen/Qwen-14B)| 14B | 51.30 | 16.95 * | 68.83 |
|
37 |
-
| [Yi-6B](https://huggingface.co/01-ai/Yi-6B) | 6B | 49.63 | 76.61 | 65.35 |
|
38 |
-
| [Qwen-7B](https://huggingface.co/01-ai/Qwen/Qwen-7B)| 7B | 42.84 | 0.0 * | 61.00 |
|
39 |
-
| [**Breeze-7B-Base-v0.1**](https://huggingface.co/MediaTek-Research/Breeze-7B-Base-v0.1) | 7B | 40.35 | 81.13
|
40 |
-
| [Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1)| 7B | 36.93 | 79.27
|
41 |
|
42 |
|
43 |
\* Few-shot learning cannot effectively guide the model to generate the proper answer.
|
|
|
28 |
|
29 |
## Base Model Performance
|
30 |
|
31 |
+
| Models | | TMMLU+ (ACC) | DRCD (EM) | Table (ACC) | MMLU (ACC) |
|
32 |
+
|----------------------------------------------|--------|--------------|-------------|-------------|------------|
|
33 |
+
| | |TC, Knowledge |TC, Reasoning|TC, Reasoning|EN, Knowledge|
|
34 |
+
| | | 5 shot | 3 shot | 5 shot | 5 shot |
|
35 |
+
| [Yi-34B](https://huggingface.co/01-ai/Yi-34B)| 34B | 63.10 | 84.57 | 49.31 | 77.42 |
|
36 |
+
| [Qwen-14B](https://huggingface.co/01-ai/Qwen/Qwen-14B)| 14B | 51.30 | 16.95 * | 50.69 | 68.83 |
|
37 |
+
| [Yi-6B](https://huggingface.co/01-ai/Yi-6B) | 6B | 49.63 | 76.61 | 34.72 | 65.35 |
|
38 |
+
| [Qwen-7B](https://huggingface.co/01-ai/Qwen/Qwen-7B)| 7B | 42.84 | 0.0 * | 39.58 | 61.00 |
|
39 |
+
| [**Breeze-7B-Base-v0.1**](https://huggingface.co/MediaTek-Research/Breeze-7B-Base-v0.1) | 7B | 40.35 | 81.13 | 28.47 | 61.63 |
|
40 |
+
| [Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1)| 7B | 36.93 | 79.27 | 27.78 | 64.89 |
|
41 |
|
42 |
|
43 |
\* Few-shot learning cannot effectively guide the model to generate the proper answer.
|