erikhenriksson
commited on
Commit
•
f0e6f1e
1
Parent(s):
066426a
Update README.md
Browse files
README.md
CHANGED
@@ -12,8 +12,8 @@ metrics:
|
|
12 |
# Web register classification (multilingual model)
|
13 |
|
14 |
A multilingual web register classifier, fine-tuned from XLM-RoBERTa-large.
|
15 |
-
The model is trained with the multilingual CORE corpora across five languages (English, Finnish, French, Swedish, Turkish) to classify documents based on the CORE taxonomy
|
16 |
-
It can predict labels for the 100 languages XLM-RoBERTa-large
|
17 |
It is designed to support the development of open language models and for linguists analyzing register variation.
|
18 |
|
19 |
## Model Details
|
|
|
12 |
# Web register classification (multilingual model)
|
13 |
|
14 |
A multilingual web register classifier, fine-tuned from XLM-RoBERTa-large.
|
15 |
+
The model is trained with the multilingual CORE corpora across five languages (English, Finnish, French, Swedish, Turkish) to classify documents based on the [CORE taxonomy](https://turkunlp.org/register-annotation-docs/).
|
16 |
+
It can predict labels for the 100 languages covered by XLM-RoBERTa-large. The model achieves state-of-the-art performance in classifying web registers for the trained languages and has strong transfer performance (see Evaluation below).
|
17 |
It is designed to support the development of open language models and for linguists analyzing register variation.
|
18 |
|
19 |
## Model Details
|