Update README.md
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ widget:
|
|
12 |
|
13 |
## Model description
|
14 |
|
15 |
-
This is the set of Chinese T5 Version 1.1 models pre-trained by [UER-py](https://github.com/dbiir/UER-py/), which is introduced in [this paper](https://arxiv.org/abs/1909.05658).
|
16 |
|
17 |
**Version 1.1**
|
18 |
|
@@ -125,6 +125,13 @@ python3 scripts/convert_t5_from_uer_to_huggingface.py --input_model_path cluecor
|
|
125 |
pages={241},
|
126 |
year={2019}
|
127 |
}
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
128 |
```
|
129 |
|
130 |
[small]:https://huggingface.co/uer/t5-v1_1-small-chinese-cluecorpussmall
|
|
|
12 |
|
13 |
## Model description
|
14 |
|
15 |
+
This is the set of Chinese T5 Version 1.1 models pre-trained by [UER-py](https://github.com/dbiir/UER-py/), which is introduced in [this paper](https://arxiv.org/abs/1909.05658). Besides, the models could also be pre-trained by [TencentPretrain](https://github.com/Tencent/TencentPretrain) introduced in [this paper](https://arxiv.org/abs/2212.06385), which inherits UER-py to support models with parameters above one billion, and extends it to a multimodal pre-training framework.
|
16 |
|
17 |
**Version 1.1**
|
18 |
|
|
|
125 |
pages={241},
|
126 |
year={2019}
|
127 |
}
|
128 |
+
|
129 |
+
@article{zhao2023tencentpretrain,
|
130 |
+
title={TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities},
|
131 |
+
author={Zhao, Zhe and Li, Yudong and Hou, Cheng and Zhao, Jing and others},
|
132 |
+
journal={ACL 2023},
|
133 |
+
pages={217},
|
134 |
+
year={2023}
|
135 |
```
|
136 |
|
137 |
[small]:https://huggingface.co/uer/t5-v1_1-small-chinese-cluecorpussmall
|