omitakahiro
commited on
Commit
•
8f254d7
1
Parent(s):
50d6f09
Update README.md
Browse files
README.md
CHANGED
@@ -75,12 +75,12 @@ GitHub: https://github.com/ku-nlp/ja-vicuna-qa-benchmark
|
|
75 |
|
76 |
| model | time [s] for genrating 100 characters in Japanese |
|
77 |
|:---:|:---:|
|
78 |
-
|stockmark-100b-instruct
|
79 |
| gpt-3.5-turbo | 2.15 |
|
80 |
| gpt-4-turbo | 5.48 |
|
81 |
-
|tokyotech-llm/Swallow-70b-instruct-hf
|
82 |
|
83 |
-
|
84 |
|
85 |
## License
|
86 |
[MIT](https://opensource.org/licenses/MIT)
|
|
|
75 |
|
76 |
| model | time [s] for genrating 100 characters in Japanese |
|
77 |
|:---:|:---:|
|
78 |
+
|stockmark-100b-instruct| 1.86 |
|
79 |
| gpt-3.5-turbo | 2.15 |
|
80 |
| gpt-4-turbo | 5.48 |
|
81 |
+
|tokyotech-llm/Swallow-70b-instruct-hf| 2.22 |
|
82 |
|
83 |
+
For local LLMs, we measured the inference time using AWS Inferentia2.
|
84 |
|
85 |
## License
|
86 |
[MIT](https://opensource.org/licenses/MIT)
|