Taka008 commited on
Commit
cd3823f
1 Parent(s): ebcfd7b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -146,6 +146,8 @@ We evaluated the models using 100 examples from the dev split.
146
 
147
  ### Japanese MT Bench
148
 
 
 
149
  | Model name | average | coding | extraction | humanities | math | reasoning | roleplay | stem | writing |
150
  | :--- | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: |
151
  | [llm-jp-3-1.8b-instruct](https://huggingface.co/llm-jp/llm-jp-3-1.8b-instruct) | 4.93 | 1.50 | 4.70 | 7.80 | 1.55 | 2.60 | 7.80 | 6.10 | 7.40 |
 
146
 
147
  ### Japanese MT Bench
148
 
149
+ We evaluated the models using `gpt-4-0613`. Please see the [codes](https://github.com/llm-jp/llm-leaderboard/tree/main) for details.
150
+
151
  | Model name | average | coding | extraction | humanities | math | reasoning | roleplay | stem | writing |
152
  | :--- | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: |
153
  | [llm-jp-3-1.8b-instruct](https://huggingface.co/llm-jp/llm-jp-3-1.8b-instruct) | 4.93 | 1.50 | 4.70 | 7.80 | 1.55 | 2.60 | 7.80 | 6.10 | 7.40 |