ptaszynski
commited on
Commit
•
ac1e775
1
Parent(s):
8aba71d
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,34 @@
|
|
1 |
---
|
|
|
|
|
|
|
2 |
license: cc-by-sa-4.0
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
|
3 |
+
language: ja
|
4 |
+
|
5 |
license: cc-by-sa-4.0
|
6 |
+
|
7 |
---
|
8 |
+
|
9 |
+
# electra-base-cyberbullying
|
10 |
+
|
11 |
+
This is an [ELECTRA](https://github.com/google-research/electra) Small model for the Japanese language finetuned for automatic cyberbullying detection.
|
12 |
+
|
13 |
+
The model was based on [Megagon Labs ELECTRA Base](https://huggingface.co/megagonlabs/transformers-ud-japanese-electra-base-discriminator), and later finetuned on a balanced dataset created by unifying two datasets, namely "Harmful BBS Japanese comments dataset" and "Twitter Japanese cyberbullying dataset".
|
14 |
+
|
15 |
+
|
16 |
+
## Licenses
|
17 |
+
|
18 |
+
The finetuned model with all attached files is licensed under [CC BY-SA 4.0](http://creativecommons.org/licenses/by-sa/4.0/), or Creative Commons Attribution-ShareAlike 4.0 International License.
|
19 |
+
|
20 |
+
<a rel="license" href="http://creativecommons.org/licenses/by-sa/4.0/"><img alt="Creative Commons License" style="border-width:0" src="https://i.creativecommons.org/l/by-sa/4.0/88x31.png" /></a>
|
21 |
+
|
22 |
+
## Citations
|
23 |
+
|
24 |
+
Please, cite this model using the following citation.
|
25 |
+
|
26 |
+
```
|
27 |
+
@inproceedings{tanabe2022electra-base-cyberbullying,
|
28 |
+
title={北見工業大学 テキスト情報処理研究室 ELECTRA Base ネットいじめ検出モデル},
|
29 |
+
author={田邊 威裕 and プタシンスキ ミハウ and エロネン ユーソ and 桝井 文人},
|
30 |
+
publisher={HuggingFace},
|
31 |
+
year={2022},
|
32 |
+
url = "https://huggingface.co/kit-nlp/transformers-ud-japanese-electra-base-discriminator-cyberbullying"
|
33 |
+
}
|
34 |
+
```
|