indiejoseph
commited on
Commit
•
7147483
1
Parent(s):
8d5138e
Update README.md
Browse files
README.md
CHANGED
@@ -9,7 +9,7 @@ pipeline_tag: text-generation
|
|
9 |
|
10 |
# CantoneseLLM
|
11 |
|
12 |
-
This model is further pre-trained model based on [01-ai/Yi-6B](https://huggingface.co/01-ai/Yi-6B) with 800M tokens of Cantonese text compiled from various sources, including translated zh-yue Wikipedia, translated RTHK news[datasets/jed351/rthk_news](https://huggingface.co/datasets/jed351/rthk_news), Cantonese filtered CC100 and Cantonese textbooks generated by Gemini Pro.
|
13 |
|
14 |
This is a preview version, for experimental use only, we will use it to fine-tune on downstream tasks and evaluate the performance.
|
15 |
|
|
|
9 |
|
10 |
# CantoneseLLM
|
11 |
|
12 |
+
This model is further pre-trained model based on [01-ai/Yi-6B](https://huggingface.co/01-ai/Yi-6B) with 800M tokens of Cantonese text compiled from various sources, including translated zh-yue Wikipedia, translated RTHK news [datasets/jed351/rthk_news](https://huggingface.co/datasets/jed351/rthk_news), Cantonese filtered CC100 and Cantonese textbooks generated by Gemini Pro.
|
13 |
|
14 |
This is a preview version, for experimental use only, we will use it to fine-tune on downstream tasks and evaluate the performance.
|
15 |
|