Update README.md
Browse filesDidn't realize the <mask> tag was messing up the preview
README.md
CHANGED
@@ -14,21 +14,21 @@ RoBERTA pretrained tokenizer vocab and merges included.
|
|
14 |
- **Preprocessing**:
|
15 |
Tokenized with a pretrained ByteLevelBPETokenizer trained on the same dataset. Uncased, with s, pad, /s, unk, and mask special tokens.
|
16 |
- **Evaluation results**:
|
17 |
-
- Mnoho ľudí tu
|
18 |
-
|
19 |
-
|
20 |
-
|
21 |
-
|
22 |
-
- Ako sa
|
23 |
-
|
24 |
-
|
25 |
-
|
26 |
-
|
27 |
-
- Plážová sezóna pod Zoborom patrí medzi
|
28 |
-
|
29 |
-
|
30 |
-
|
31 |
-
|
32 |
|
33 |
- **Limitations**:
|
34 |
The current model is fairly small, although it works very well. This model is meant to be finetuned on downstream tasks e.g. Part-of-Speech tagging, Question Answering, anything in GLUE or SUPERGLUE.
|
|
|
14 |
- **Preprocessing**:
|
15 |
Tokenized with a pretrained ByteLevelBPETokenizer trained on the same dataset. Uncased, with s, pad, /s, unk, and mask special tokens.
|
16 |
- **Evaluation results**:
|
17 |
+
- Mnoho ľudí tu MASK
|
18 |
+
- žije.
|
19 |
+
- žijú.
|
20 |
+
- je.
|
21 |
+
- trpí.
|
22 |
+
- Ako sa MASK
|
23 |
+
- máte
|
24 |
+
- máš
|
25 |
+
- má
|
26 |
+
- hovorí
|
27 |
+
- Plážová sezóna pod Zoborom patrí medzi MASK obdobia.
|
28 |
+
- ročné
|
29 |
+
- najkrajšie
|
30 |
+
- najobľúbenejšie
|
31 |
+
- najnáročnejšie
|
32 |
|
33 |
- **Limitations**:
|
34 |
The current model is fairly small, although it works very well. This model is meant to be finetuned on downstream tasks e.g. Part-of-Speech tagging, Question Answering, anything in GLUE or SUPERGLUE.
|