Update README.md
Browse files
README.md
CHANGED
@@ -12,6 +12,6 @@ language:
|
|
12 |
|
13 |
Finetuned from [apple/OpenELM-270M](https://huggingface.co/apple/OpenELM-270M):
|
14 |
|
15 |
-
* Extended
|
16 |
* Continual pre-trained with a mix of [bigscience-data/roots_zh-tw_wikipedia](https://huggingface.co/datasets/bigscience-data/roots_zh-tw_wikipedia) and [bigscience-data/roots_en_wikipedia](https://huggingface.co/datasets/bigscience-data/roots_en_wikipedia).
|
17 |
* Evaluation ppl = 1.6644828403646825 (split 3% training data as evaluation set)
|
|
|
12 |
|
13 |
Finetuned from [apple/OpenELM-270M](https://huggingface.co/apple/OpenELM-270M):
|
14 |
|
15 |
+
* Extended vocabulary from 32000 to 75873 with sentencepiece bpe trained on [bigscience-data/roots_zh-tw_wikipedia](https://huggingface.co/datasets/bigscience-data/roots_zh-tw_wikipedia) and used average embedding to initialize the new embeddings.
|
16 |
* Continual pre-trained with a mix of [bigscience-data/roots_zh-tw_wikipedia](https://huggingface.co/datasets/bigscience-data/roots_zh-tw_wikipedia) and [bigscience-data/roots_en_wikipedia](https://huggingface.co/datasets/bigscience-data/roots_en_wikipedia).
|
17 |
* Evaluation ppl = 1.6644828403646825 (split 3% training data as evaluation set)
|