EIStakovskii commited on
Commit
a9cc25a
1 Parent(s): 5e139d7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md CHANGED
@@ -27,3 +27,11 @@ widget:
27
 
28
  license: other
29
  ---
 
 
 
 
 
 
 
 
 
27
 
28
  license: other
29
  ---
30
+
31
+ This model is trained for multilingual toxicity labeling. Label_1 means TOXIC, Label_0 means NOT_TOXIC.
32
+
33
+ The model was fine-tuned based off the xlm_roberta_base model for 4 languages: EN, RU, FR, DE
34
+
35
+ The validation accuracy is 92%.
36
+
37
+ The model was finetuned on the total sum of 100933k sentences. The train data for English and Russian came from https://github.com/s-nlp/multilingual_detox, French data comprised the translated to French data from https://github.com/s-nlp/multilingual_detox as well as all the French data from the Jigsaw dataset, the German data was similarly composed using translations and semi-manual data collection techniquies.