EIStakovskii
commited on
Commit
•
a9cc25a
1
Parent(s):
5e139d7
Update README.md
Browse files
README.md
CHANGED
@@ -27,3 +27,11 @@ widget:
|
|
27 |
|
28 |
license: other
|
29 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
27 |
|
28 |
license: other
|
29 |
---
|
30 |
+
|
31 |
+
This model is trained for multilingual toxicity labeling. Label_1 means TOXIC, Label_0 means NOT_TOXIC.
|
32 |
+
|
33 |
+
The model was fine-tuned based off the xlm_roberta_base model for 4 languages: EN, RU, FR, DE
|
34 |
+
|
35 |
+
The validation accuracy is 92%.
|
36 |
+
|
37 |
+
The model was finetuned on the total sum of 100933k sentences. The train data for English and Russian came from https://github.com/s-nlp/multilingual_detox, French data comprised the translated to French data from https://github.com/s-nlp/multilingual_detox as well as all the French data from the Jigsaw dataset, the German data was similarly composed using translations and semi-manual data collection techniquies.
|