cllatMTK commited on
Commit
6af287e
1 Parent(s): 78f5d60

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -10
README.md CHANGED
@@ -88,17 +88,17 @@ and is comparable with Mistral-7B-Instruct-v0.1 on MMLU and MT-Bench in English.
88
  In this test, we use the first 700 characters a [web article](https://health.udn.com/health/story/5976/7699252?from=udn_ch1005_main_index) as input and ask the model to rewrite the article.
89
  All models were inferenced with `vllm` on 2 A6000 (TP=2 ).
90
 
91
- | Models | Speed in char/sec (output token #) |Estimated Max Input Length (TC Char)|
92
  |--------------------------------------------------------------------|-------------------|--------------------------|
93
- | Yi-6B | 65.91 (631 token) | 4.4k |
94
- | **Breeze-7B-Instruct-v0.1** | 65.17 (518 token) | 10.1k |
95
- | **Breeze-7B-Instruct-64k-v0.1** | 65.17 (518 token) | 80.8k |
96
- | Qwen-7B | 64.45 (582 token) | 9.7k |
97
- | Qwen-14B | 37.05 (582 token) | 9.7k |
98
- | Mistral-7B-v0.1 | 34.17 (1110 token) | 6.3k |
99
- | Taiwan-LLM-7B-v2.1-base | 26.65 (1302 token) | 2.6k |
100
- | Taiwan-LLM-13B-v2.0-base | 19.02 (1302 token) | 2.6k |
101
- | Yi-34B | 16.01 (631 token) | 4.4k |
102
 
103
  ## Examples
104
 
 
88
  In this test, we use the first 700 characters a [web article](https://health.udn.com/health/story/5976/7699252?from=udn_ch1005_main_index) as input and ask the model to rewrite the article.
89
  All models were inferenced with `vllm` on 2 A6000 (TP=2 ).
90
 
91
+ | Models | Inference Time (sec)|Estimated Max Input Length (TC Char)|
92
  |--------------------------------------------------------------------|-------------------|--------------------------|
93
+ | Yi-6B | 10.62 | 5.2k |
94
+ | **Breeze-7B-Instruct-v0.1** | 10.74 | 11.1k |
95
+ | **Breeze-7B-Instruct-64k-v0.1** | 10.74 | 88.8k |
96
+ | Qwen-7B | 10.86 | 9.8k |
97
+ | Qwen-14B | 18.89 | 9.8k |
98
+ | Mistral-7B-v0.1 | 20.48 | 5.1k |
99
+ | Taiwan-LLM-7B-v2.1-base | 26.26 | 2.2k |
100
+ | Taiwan-LLM-13B-v2.0-base | 36.8 | 2.2k |
101
+ | Yi-34B | 43.71 | 4.5k |
102
 
103
  ## Examples
104