liswei commited on
Commit
cb51eb9
1 Parent(s): d290234

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -12,6 +12,6 @@ language:
12
 
13
  Finetuned from [apple/OpenELM-270M](https://huggingface.co/apple/OpenELM-270M):
14
 
15
- * Extended tokenizer with ~30K Chinese vocabs trained on [bigscience-data/roots_zh-tw_wikipedia](https://huggingface.co/datasets/bigscience-data/roots_zh-tw_wikipedia).
16
  * Continual pre-trained with a mix of [bigscience-data/roots_zh-tw_wikipedia](https://huggingface.co/datasets/bigscience-data/roots_zh-tw_wikipedia) and [bigscience-data/roots_en_wikipedia](https://huggingface.co/datasets/bigscience-data/roots_en_wikipedia).
17
  * Evaluation ppl = 1.6644828403646825 (split 3% training data as evaluation set)
 
12
 
13
  Finetuned from [apple/OpenELM-270M](https://huggingface.co/apple/OpenELM-270M):
14
 
15
+ * Extended vocabulary from 32000 to 75873 with sentencepiece bpe trained on [bigscience-data/roots_zh-tw_wikipedia](https://huggingface.co/datasets/bigscience-data/roots_zh-tw_wikipedia) and used average embedding to initialize the new embeddings.
16
  * Continual pre-trained with a mix of [bigscience-data/roots_zh-tw_wikipedia](https://huggingface.co/datasets/bigscience-data/roots_zh-tw_wikipedia) and [bigscience-data/roots_en_wikipedia](https://huggingface.co/datasets/bigscience-data/roots_en_wikipedia).
17
  * Evaluation ppl = 1.6644828403646825 (split 3% training data as evaluation set)