Update README.md
Browse files
README.md
CHANGED
@@ -26,7 +26,7 @@ and is comparable with Mistral-7B-Instruct-v0.1 on MMLU and MT-Bench in English.
|
|
26 |
- **Model type:** Causal decoder-only transformer language model
|
27 |
- **Language:** English and Traditional Chinese (zh-tw)
|
28 |
|
29 |
-
##
|
30 |
|
31 |
| Models | | TMMLU+ (ACC) | DRCD (EM) | MMLU (ACC) |
|
32 |
|---------------------------------------------|--------|--------------|-----------|------------|
|
@@ -68,7 +68,7 @@ and is comparable with Mistral-7B-Instruct-v0.1 on MMLU and MT-Bench in English.
|
|
68 |
| Qwen/Qwen-7B | | |
|
69 |
| Qwen/Qwen-14B | | |
|
70 |
|
71 |
-
##
|
72 |
|
73 |
| Models | | TMMLU+ (ACC) | DRCD (EM) | MT-Bench-tw (Score) | MMLU (ACC) | MT-Bench (Score) |
|
74 |
|--------------------------------------------|--------|--------------|-----------|---------------------|------------|------------------|
|
@@ -96,6 +96,7 @@ and is comparable with Mistral-7B-Instruct-v0.1 on MMLU and MT-Bench in English.
|
|
96 |
| 01-ai/Yi-34B-Chat | 46.36 | 65.02 | 52.84 | 52.21 |
|
97 |
| gpt-3.5-turbo-1106 | | | | |
|
98 |
|
|
|
99 |
|
100 |
|
101 |
|
|
|
26 |
- **Model type:** Causal decoder-only transformer language model
|
27 |
- **Language:** English and Traditional Chinese (zh-tw)
|
28 |
|
29 |
+
## Base Model Performance
|
30 |
|
31 |
| Models | | TMMLU+ (ACC) | DRCD (EM) | MMLU (ACC) |
|
32 |
|---------------------------------------------|--------|--------------|-----------|------------|
|
|
|
68 |
| Qwen/Qwen-7B | | |
|
69 |
| Qwen/Qwen-14B | | |
|
70 |
|
71 |
+
## Chat Model Performance
|
72 |
|
73 |
| Models | | TMMLU+ (ACC) | DRCD (EM) | MT-Bench-tw (Score) | MMLU (ACC) | MT-Bench (Score) |
|
74 |
|--------------------------------------------|--------|--------------|-----------|---------------------|------------|------------------|
|
|
|
96 |
| 01-ai/Yi-34B-Chat | 46.36 | 65.02 | 52.84 | 52.21 |
|
97 |
| gpt-3.5-turbo-1106 | | | | |
|
98 |
|
99 |
+
## Examples
|
100 |
|
101 |
|
102 |
|