YC-Chen commited on
Commit
132e81b
1 Parent(s): 643ea52

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -58,10 +58,11 @@ and is comparable with Mistral-7B-Instruct-v0.1 on MMLU and MT-Bench in English.
58
  |--------------------------------------------|--------|--------------|--------------|-----------|-------------|--------|------------|------------|------------------|
59
  | | |TC, Knowledge |TC, Knowledge |TC, Reasoning|TC, Reasoning|TC, Chat |EN, Knowledge|EN, Knowledge|EN, Chat |
60
  | | | 0 shot | 5 shot | 3 shot | 0 shot | 0 shot | 0 shot | 5 shot | 0 shot |
61
- | gpt-3.5-turbo | | 41.76 | | | | 7.1 | 70.0 | | 7.9 |
62
  | [Yi-34B-Chat](https://huggingface.co/01-ai/Yi-34B-Chat) | 34B | 54.87 | | | 36.81 | 6.9 | 71.04 | | 7.6 |
63
  | [Qwen-14B-Chat](https://huggingface.co/Qwen/Qwen-14B-Chat) | 14B | 48.41 | | | 41.67 | 6.4 | 64.91 | | 7.2 |
64
  | [Yi-6B-Chat](https://huggingface.co/01-ai/Yi-6B-Chat) | 6B | 44.79 | | | 25.69 | 5.0 | 59.45 | | 6.0 |
 
65
  | [**Breeze-7B-Instruct-v0.1**](https://huggingface.co/MediaTek-Research/Breeze-7B-Instruct-v0.1) | 7B | 41.61 | | | 45.83 | 5.7 | 63.26 | | 7.1 |
66
  | [**Breeze-7B-Instruct-64k-v0.1**](https://huggingface.co/MediaTek-Research/Breeze-7B-Instruct-64k-v0.1) | 7B | 40.99 | | | 36.11 | 5.5 | 63.68 | | 7.1 |
67
  | [Qwen-7B-Chat](https://huggingface.co/Qwen/Qwen-7B-Chat) | 7B | 40.02 | | | 33.33 | 5.4 | 55.94 | | 6.2 |
@@ -73,10 +74,10 @@ and is comparable with Mistral-7B-Instruct-v0.1 on MMLU and MT-Bench in English.
73
 
74
  | Category ACC of TMMLU+ (0 shot) | STEM | Social Science | Humanities | Other |
75
  |-----------------------------------------------------|--------------|----------------|------------|------------|
76
- | gpt-3.5-turbo | 41.56 | 46.72 | 36.73 | 42.03 |
77
  | Yi-34B-Chat | 47.65 | 64.25 | 52.73 | 54.91 |
78
  | Qwen-14B-Chat | 43.83 | 55.00 | 48.55 | 46.22 |
79
  | Yi-6B-Chat | 37.80 | 51.74 | 45.36 | 44.25 |
 
80
  | **Breeze-7B-Instruct-v0.1** | 37.41 | 46.81 | 42.06 | 40.16 |
81
  | **Breeze-7B-Instruct-64k-v0.1** | 37.88 | 46.35 | 40.31 | 39.40 |
82
  | Qwen-7B-Chat | 35.44 | 46.22 | 38.35 | 40.06 |
 
58
  |--------------------------------------------|--------|--------------|--------------|-----------|-------------|--------|------------|------------|------------------|
59
  | | |TC, Knowledge |TC, Knowledge |TC, Reasoning|TC, Reasoning|TC, Chat |EN, Knowledge|EN, Knowledge|EN, Chat |
60
  | | | 0 shot | 5 shot | 3 shot | 0 shot | 0 shot | 0 shot | 5 shot | 0 shot |
61
+
62
  | [Yi-34B-Chat](https://huggingface.co/01-ai/Yi-34B-Chat) | 34B | 54.87 | | | 36.81 | 6.9 | 71.04 | | 7.6 |
63
  | [Qwen-14B-Chat](https://huggingface.co/Qwen/Qwen-14B-Chat) | 14B | 48.41 | | | 41.67 | 6.4 | 64.91 | | 7.2 |
64
  | [Yi-6B-Chat](https://huggingface.co/01-ai/Yi-6B-Chat) | 6B | 44.79 | | | 25.69 | 5.0 | 59.45 | | 6.0 |
65
+ | gpt-3.5-turbo | | 41.76 | | | | 7.1 | 70.00 | | 7.9 |
66
  | [**Breeze-7B-Instruct-v0.1**](https://huggingface.co/MediaTek-Research/Breeze-7B-Instruct-v0.1) | 7B | 41.61 | | | 45.83 | 5.7 | 63.26 | | 7.1 |
67
  | [**Breeze-7B-Instruct-64k-v0.1**](https://huggingface.co/MediaTek-Research/Breeze-7B-Instruct-64k-v0.1) | 7B | 40.99 | | | 36.11 | 5.5 | 63.68 | | 7.1 |
68
  | [Qwen-7B-Chat](https://huggingface.co/Qwen/Qwen-7B-Chat) | 7B | 40.02 | | | 33.33 | 5.4 | 55.94 | | 6.2 |
 
74
 
75
  | Category ACC of TMMLU+ (0 shot) | STEM | Social Science | Humanities | Other |
76
  |-----------------------------------------------------|--------------|----------------|------------|------------|
 
77
  | Yi-34B-Chat | 47.65 | 64.25 | 52.73 | 54.91 |
78
  | Qwen-14B-Chat | 43.83 | 55.00 | 48.55 | 46.22 |
79
  | Yi-6B-Chat | 37.80 | 51.74 | 45.36 | 44.25 |
80
+ | gpt-3.5-turbo | 41.56 | 46.72 | 36.73 | 42.03 |
81
  | **Breeze-7B-Instruct-v0.1** | 37.41 | 46.81 | 42.06 | 40.16 |
82
  | **Breeze-7B-Instruct-64k-v0.1** | 37.88 | 46.35 | 40.31 | 39.40 |
83
  | Qwen-7B-Chat | 35.44 | 46.22 | 38.35 | 40.06 |