Autoupdate README.md
Browse files
README.md
CHANGED
@@ -119,10 +119,7 @@ The `ul2-small-dutch-english` T5 model was pre-trained simultaneously on a combi
|
|
119 |
including the `full_en_nl` config of the "mc4_nl_cleaned" dataset, which is a cleaned version of Common Crawl's web
|
120 |
crawl corpus, Dutch books, the Dutch subset of Wikipedia (2022-03-20), the English subset of Wikipedia (2022-03-01),
|
121 |
and a subset of "mc4_nl_cleaned"
|
122 |
-
containing only texts from Dutch
|
123 |
-
towards descriptions of events in the Netherlands and Belgium.
|
124 |
-
|
125 |
-
|
126 |
|
127 |
## Training procedure
|
128 |
|
|
|
119 |
including the `full_en_nl` config of the "mc4_nl_cleaned" dataset, which is a cleaned version of Common Crawl's web
|
120 |
crawl corpus, Dutch books, the Dutch subset of Wikipedia (2022-03-20), the English subset of Wikipedia (2022-03-01),
|
121 |
and a subset of "mc4_nl_cleaned"
|
122 |
+
containing only texts from Dutch newspapers.
|
|
|
|
|
|
|
123 |
|
124 |
## Training procedure
|
125 |
|