cointegrated
/

rubert-tiny-toxicity

Text Classification

Inference Endpoints

Model card Files Files and versions Community

cointegrated commited on May 16

Commit

fd5e387

•

1 Parent(s): 9e6d94e

add the training notebook

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -60,7 +60,8 @@ print(text2toxicity(['я люблю нигеров', 'я люблю африка
 ## Training
-The model has been trained on the joint dataset of [OK ML Cup](https://cups.mail.ru/ru/tasks/1048) and [Babakov et.al.](https://arxiv.org/abs/2103.05345) with `Adam` optimizer, the learning rate of `1e-5`, and batch size of `64` for `15` epochs. A text was considered inappropriate if its inappropriateness score was higher than 0.8, and appropriate - if it was lower than 0.2. The per-label ROC AUC on the dev set is:
 ```
 non-toxic  : 0.9937
 insult     : 0.9912

 ## Training
+The model has been trained on the joint dataset of [OK ML Cup](https://cups.mail.ru/ru/tasks/1048) and [Babakov et.al.](https://arxiv.org/abs/2103.05345) with `Adam` optimizer, the learning rate of `1e-5`, and batch size of `64` for `15` epochs in [this Colab notebook](https://colab.research.google.com/drive/1o7azO7cHttwofkp8eTZo9LIybYaNWei_?usp=sharing).
+A text was considered inappropriate if its inappropriateness score was higher than 0.8, and appropriate - if it was lower than 0.2. The per-label ROC AUC on the dev set is:
 ```
 non-toxic  : 0.9937
 insult     : 0.9912