MiniLLM
/

Pretrain-Qwen-500M

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

t1101675 commited on 11 days ago

Commit

e1c8e76

•

1 Parent(s): 701e8a5

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ pipeline_tag: text-generation
 [paper](https://arxiv.org/abs/2410.17215) | [code](https://github.com/thu-coai/MiniPLM)
-**Pretrain-Qwen-500M** is a 500M model with QWen achitecture conventionally pre-trained from scratch on [the Pile](https://huggingface.co/datasets/monology/pile-uncopyrighted) for 50B tokens.
 We also open-source the tokenized [pre-training corpus](https://huggingface.co/datasets/MiniLLM/pile-tokenized) for reproducibility.

 [paper](https://arxiv.org/abs/2410.17215) | [code](https://github.com/thu-coai/MiniPLM)
+**Pretrain-Qwen-500M** is a 500M model with Qwen achitecture conventionally pre-trained from scratch on [the Pile](https://huggingface.co/datasets/monology/pile-uncopyrighted) for 50B tokens.
 We also open-source the tokenized [pre-training corpus](https://huggingface.co/datasets/MiniLLM/pile-tokenized) for reproducibility.