martynawck
commited on
Commit
•
7e267e7
1
Parent(s):
9f5c22f
Update index.html
Browse files- index.html +14 -17
index.html
CHANGED
@@ -11,15 +11,15 @@
|
|
11 |
|
12 |
<div class="container">
|
13 |
<h2 style="text-align: center;">NLPre-PL Dataset</h2>
|
14 |
-
<p>The official NLPre-PL dataset - a uniformly paragraph-level divided version of NKJP1M corpus – the 1-million token balanced subcorpus of the National Corpus of Polish (Narodowy Korpus
|
15 |
</p>
|
16 |
-
<p
|
17 |
The NLPre dataset aims at fairly dividing the paragraphs length-wise and topic-wise into train, development, and test sets. Thus, we ensure a similar number of segments distribution per paragraph and avoid the situation when paragraphs with a small (or large) number of segments are available only e.g. during test time.
|
18 |
</p>
|
19 |
<p>
|
20 |
<a style="text-align: center;" href="https://huggingface.co/datasets/ipipan/nlprepl" target="_blank" class="btn btn-primary btn-lg active" role="button" aria-pressed="true">🤗 NLPre-PL Dataset</a>
|
21 |
|
22 |
-
<a style="text-align: center;"href="
|
23 |
</p>
|
24 |
|
25 |
<div><p></p></div>
|
@@ -34,26 +34,23 @@
|
|
34 |
|
35 |
<p><b>UD TAGSET</b></p>
|
36 |
<p>
|
37 |
-
<ul class="list-group list-group-light list-group-small">
|
38 |
-
|
39 |
-
|
40 |
-
<li class="list-group-item"><a href="https://
|
41 |
-
<li class="list-group-item"><a href="https://huggingface.co/ipipan/
|
42 |
-
<li class="list-group-item"><a href="https://huggingface.co/ipipan/
|
43 |
-
<li class="list-group-item"><a href="https://huggingface.co/ipipan/
|
44 |
-
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_herBERT_nkjp-by-type" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + NLPrePL-fair-by-type </a></li>
|
45 |
-
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_fasttext_nkjp-by-name" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-name </a></li>
|
46 |
-
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_fasttext_nkjp-by-type" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-type </a></li>
|
47 |
</ul>
|
48 |
</p>
|
49 |
|
50 |
<p><b>NKJP TAGSET</b></p>
|
51 |
<p>
|
52 |
<ul class="list-group list-group-light list-group-small">
|
53 |
-
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_herBERT_nkjp-by-name" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + NLPrePL-fair-by-name</a></li>
|
54 |
-
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_herBERT_nkjp-by-type" class="btn btn-seconday btn-lg active" >COMBO + HerBERT + NLPrePL-fair-by-type </a></li>
|
55 |
-
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_fasttext_nkjp-by-name" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-name</a></li>
|
56 |
-
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_fasttext_nkjp-by-type" class="btn btn-seconday btn-lg active" >COMBO + fasttext + NLPrePL-fair-by-type </a></li>
|
57 |
</ul>
|
58 |
|
59 |
|
|
|
11 |
|
12 |
<div class="container">
|
13 |
<h2 style="text-align: center;">NLPre-PL Dataset</h2>
|
14 |
+
<p>The official NLPre-PL dataset - a uniformly paragraph-level divided version of NKJP1M corpus – the 1-million token balanced subcorpus of the National Corpus of Polish (Narodowy Korpus Jezyka Polskiego).
|
15 |
</p>
|
16 |
+
<p>
|
17 |
The NLPre dataset aims at fairly dividing the paragraphs length-wise and topic-wise into train, development, and test sets. Thus, we ensure a similar number of segments distribution per paragraph and avoid the situation when paragraphs with a small (or large) number of segments are available only e.g. during test time.
|
18 |
</p>
|
19 |
<p>
|
20 |
<a style="text-align: center;" href="https://huggingface.co/datasets/ipipan/nlprepl" target="_blank" class="btn btn-primary btn-lg active" role="button" aria-pressed="true">🤗 NLPre-PL Dataset</a>
|
21 |
|
22 |
+
<a style="text-align: center;"href="http://git.nlp.ipipan.waw.pl/alina/PDBUD" target="_blank" class="btn btn-primary btn-lg active" role="button" aria-pressed="true">🤗 PDB-UD Dataset</a>
|
23 |
</p>
|
24 |
|
25 |
<div><p></p></div>
|
|
|
34 |
|
35 |
<p><b>UD TAGSET</b></p>
|
36 |
<p>
|
37 |
+
<ul class="list-group list-group-light list-group-small">
|
38 |
+
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_herBERT_pdb" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + PDB-UD</a></li>
|
39 |
+
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_fasttext_pdb" target="_blank" class="btn btn-seconday btn-lg active" >COMBO + fasttext + PDB-UD </a></li>
|
40 |
+
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_herBERT_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + NLPrePL-fair-by-name </a></li>
|
41 |
+
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_herBERT_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + NLPrePL-fair-by-type </a></li>
|
42 |
+
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_fasttext_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-name </a></li>
|
43 |
+
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_fasttext_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-type </a></li>
|
|
|
|
|
|
|
44 |
</ul>
|
45 |
</p>
|
46 |
|
47 |
<p><b>NKJP TAGSET</b></p>
|
48 |
<p>
|
49 |
<ul class="list-group list-group-light list-group-small">
|
50 |
+
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_herBERT_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + NLPrePL-fair-by-name</a></li>
|
51 |
+
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_herBERT_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" >COMBO + HerBERT + NLPrePL-fair-by-type </a></li>
|
52 |
+
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_fasttext_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-name</a></li>
|
53 |
+
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_fasttext_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" >COMBO + fasttext + NLPrePL-fair-by-type </a></li>
|
54 |
</ul>
|
55 |
|
56 |
|