Update README.md
Browse files
README.md
CHANGED
@@ -28,7 +28,7 @@ This repository provides large language models developed by the [Research and De
|
|
28 |
|
29 |
The development was partially supported by [GENIAC](https://www.meti.go.jp/policy/mono_info_service/geniac/index.html).
|
30 |
|
31 |
-
| Model
|
32 |
| :--- |
|
33 |
| [llm-jp-3-1.8b](https://huggingface.co/llm-jp/llm-jp-3-1.8b) |
|
34 |
| [llm-jp-3-1.8b-instruct](https://huggingface.co/llm-jp/llm-jp-3-1.8b-instruct) |
|
@@ -82,7 +82,6 @@ print(tokenizer.decode(output))
|
|
82 |
|3.7b|28|3072|24|4096|611,844,096|3,171,068,928|
|
83 |
|13b|40|5120|40|4096|1,019,740,160|12,688,184,320|
|
84 |
|
85 |
-
|
86 |
## Tokenizer
|
87 |
|
88 |
The tokenizer of this model is based on [huggingface/tokenizers](https://github.com/huggingface/tokenizers) Unigram byte-fallback model.
|
@@ -133,7 +132,7 @@ The models have been fine-tuned on the following datasets.
|
|
133 |
|
134 |
### llm-jp-eval (v1.3.1)
|
135 |
|
136 |
-
We evaluated using 100 examples from the dev split.
|
137 |
|
138 |
| Model name | average | EL | FA | HE | MC | MR | MT | NLI | QA | RC |
|
139 |
| :--- | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: |
|
@@ -172,4 +171,4 @@ llm-jp(at)nii.ac.jp
|
|
172 |
|
173 |
## Model Card Authors
|
174 |
|
175 |
-
Takashi Kodama.
|
|
|
28 |
|
29 |
The development was partially supported by [GENIAC](https://www.meti.go.jp/policy/mono_info_service/geniac/index.html).
|
30 |
|
31 |
+
| Model Variants |
|
32 |
| :--- |
|
33 |
| [llm-jp-3-1.8b](https://huggingface.co/llm-jp/llm-jp-3-1.8b) |
|
34 |
| [llm-jp-3-1.8b-instruct](https://huggingface.co/llm-jp/llm-jp-3-1.8b-instruct) |
|
|
|
82 |
|3.7b|28|3072|24|4096|611,844,096|3,171,068,928|
|
83 |
|13b|40|5120|40|4096|1,019,740,160|12,688,184,320|
|
84 |
|
|
|
85 |
## Tokenizer
|
86 |
|
87 |
The tokenizer of this model is based on [huggingface/tokenizers](https://github.com/huggingface/tokenizers) Unigram byte-fallback model.
|
|
|
132 |
|
133 |
### llm-jp-eval (v1.3.1)
|
134 |
|
135 |
+
We evaluated the models using 100 examples from the dev split.
|
136 |
|
137 |
| Model name | average | EL | FA | HE | MC | MR | MT | NLI | QA | RC |
|
138 |
| :--- | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: |
|
|
|
171 |
|
172 |
## Model Card Authors
|
173 |
|
174 |
+
Takashi Kodama.
|