Update README.md
Browse files
README.md
CHANGED
@@ -88,7 +88,7 @@ and is comparable with Mistral-7B-Instruct-v0.1 on MMLU and MT-Bench in English.
|
|
88 |
In this test, we use the first 700 characters of the [web article](https://health.udn.com/health/story/5976/7699252?from=udn_ch1005_main_index) as the input and ask the model to write the same article again.
|
89 |
All models were inferenced with `vllm` on 2 NVIDIA RTX A6000 (TP=2).
|
90 |
|
91 |
-
| Models | Inference Time (sec)|Estimated Max Input Length (
|
92 |
|--------------------------------------------------------------------|-------------------|--------------------------|
|
93 |
| Yi-6B | 10.62 | 5.2k |
|
94 |
| **Breeze-7B-Instruct-v0.1** | 10.74 | 11.1k |
|
|
|
88 |
In this test, we use the first 700 characters of the [web article](https://health.udn.com/health/story/5976/7699252?from=udn_ch1005_main_index) as the input and ask the model to write the same article again.
|
89 |
All models were inferenced with `vllm` on 2 NVIDIA RTX A6000 (TP=2).
|
90 |
|
91 |
+
| Models | Inference Time (sec)|Estimated Max Input Length (Char)|
|
92 |
|--------------------------------------------------------------------|-------------------|--------------------------|
|
93 |
| Yi-6B | 10.62 | 5.2k |
|
94 |
| **Breeze-7B-Instruct-v0.1** | 10.74 | 11.1k |
|