stockmark
/

stockmark-100b-instruct-v0.1

Inference Endpoints

Model card Files Files and versions Community

omitakahiro commited on May 15

Commit

8f254d7

•

1 Parent(s): 50d6f09

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -75,12 +75,12 @@ GitHub: https://github.com/ku-nlp/ja-vicuna-qa-benchmark
 | model | time [s] for genrating 100 characters in Japanese |
 |:---:|:---:|
-|stockmark-100b-instruct[^2]| 1.86 |
 | gpt-3.5-turbo | 2.15 |
 | gpt-4-turbo | 5.48 |
-|tokyotech-llm/Swallow-70b-instruct-hf[^2]| 2.22 |
-[^2]: We measured the time using AWS Inferentia2.
 ## License
 [MIT](https://opensource.org/licenses/MIT)

 | model | time [s] for genrating 100 characters in Japanese |
 |:---:|:---:|
+|stockmark-100b-instruct| 1.86 |
 | gpt-3.5-turbo | 2.15 |
 | gpt-4-turbo | 5.48 |
+|tokyotech-llm/Swallow-70b-instruct-hf| 2.22 |
+For local LLMs, we measured the inference time using AWS Inferentia2.
 ## License
 [MIT](https://opensource.org/licenses/MIT)