HachiML
/

Llama-2-13b-hf-qlora-dolly-ja-2ep

Model card Files Files and versions Community

HachiML commited on Aug 8, 2023

Commit

bf12e52

•

1 Parent(s): 3b5e67e

Update README.md

Files changed (1) hide show

README.md +8 -8

README.md CHANGED Viewed

@@ -8,17 +8,17 @@ language:
 ---
 ## JGLUE Score
 I evaluated this model using the following JGLUE tasks. Here are the scores:
-| Task                | Llama-2-7b-hf     | Llama-2-13b-hf    | This Model |
-|---------------------|:-----------------:|:-----------------:|:----------:|
-| JCOMMONSENSEQA(acc) | 51.56             | 75.06             | 75.78      |
-| JNLI(acc)           | 29.74             | 22.18             | 50.69      |
-| MARC_JA(acc)        | 85.72             | -             | 79.64      |
-| JSQUAD(exact_match) | 64.16             | 76.13             | 62.83      |
-| **Average**         | **57.79**         | **-**         | **67.23**  |
 - Note: Use v0.3 prompt template
 - The JGLUE scores were measured using the following script:
 [Stability-AI/lm-evaluation-harness](https://github.com/Stability-AI/lm-evaluation-harness/tree/jp-stable)
-- (*) Refer to the following article: [Google Colab での JP Language Model Evaluation Harness による日本語LLMの評価手順](https://note.com/npaka/n/nedf4dacd4037)
 ## How to use

 ---
 ## JGLUE Score
 I evaluated this model using the following JGLUE tasks. Here are the scores:
+| Task                | Llama-2-13b-hf(*) | This Model |
+|---------------------|:-----------------:|:----------:|
+| JCOMMONSENSEQA(acc) | 75.06             | 75.78      |
+| JNLI(acc)           | 22.18             | 50.69      |
+| MARC_JA(acc)        | 38.83             | 79.64      |
+| JSQUAD(exact_match) | 76.13             | 62.83      |
+| **Average**         | **53.05**         | **67.23**  |
 - Note: Use v0.3 prompt template
 - The JGLUE scores were measured using the following script:
 [Stability-AI/lm-evaluation-harness](https://github.com/Stability-AI/lm-evaluation-harness/tree/jp-stable)
+- (*) A similar method was used to measure these scores.
 ## How to use