martynawck commited on
Commit
7e267e7
1 Parent(s): 9f5c22f

Update index.html

Browse files
Files changed (1) hide show
  1. index.html +14 -17
index.html CHANGED
@@ -11,15 +11,15 @@
11
 
12
  <div class="container">
13
  <h2 style="text-align: center;">NLPre-PL Dataset</h2>
14
- <p>The official NLPre-PL dataset - a uniformly paragraph-level divided version of NKJP1M corpus – the 1-million token balanced subcorpus of the National Corpus of Polish (Narodowy Korpus Języka Polskiego).
15
  </p>
16
- <p></p>
17
  The NLPre dataset aims at fairly dividing the paragraphs length-wise and topic-wise into train, development, and test sets. Thus, we ensure a similar number of segments distribution per paragraph and avoid the situation when paragraphs with a small (or large) number of segments are available only e.g. during test time.
18
  </p>
19
  <p>
20
  <a style="text-align: center;" href="https://huggingface.co/datasets/ipipan/nlprepl" target="_blank" class="btn btn-primary btn-lg active" role="button" aria-pressed="true">&#129303; NLPre-PL Dataset</a>
21
 
22
- <a style="text-align: center;"href="https://git.nlp.ipipan.waw.pl/alina/PDBUD" target="_blank" class="btn btn-primary btn-lg active" role="button" aria-pressed="true">&#129303; PDB-UD Dataset</a>
23
  </p>
24
 
25
  <div><p></p></div>
@@ -34,26 +34,23 @@
34
 
35
  <p><b>UD TAGSET</b></p>
36
  <p>
37
- <ul class="list-group list-group-light list-group-small">
38
-
39
-
40
- <li class="list-group-item"><a href="https://git.nlp.ipipan.waw.pl/alina/PDBUD" class="btn btn-seconday btn-lg active" target="_blank" > alina</a></li>
41
- <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_herBERT_pdb" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + PDB-UD</a></li>
42
- <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_fasttext_pdb" class="btn btn-seconday btn-lg active" >COMBO + fasttext + PDB-UD </a></li>
43
- <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_herBERT_nkjp-by-name" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + NLPrePL-fair-by-name </a></li>
44
- <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_herBERT_nkjp-by-type" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + NLPrePL-fair-by-type </a></li>
45
- <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_fasttext_nkjp-by-name" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-name </a></li>
46
- <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_fasttext_nkjp-by-type" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-type </a></li>
47
  </ul>
48
  </p>
49
 
50
  <p><b>NKJP TAGSET</b></p>
51
  <p>
52
  <ul class="list-group list-group-light list-group-small">
53
- <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_herBERT_nkjp-by-name" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + NLPrePL-fair-by-name</a></li>
54
- <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_herBERT_nkjp-by-type" class="btn btn-seconday btn-lg active" >COMBO + HerBERT + NLPrePL-fair-by-type </a></li>
55
- <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_fasttext_nkjp-by-name" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-name</a></li>
56
- <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_fasttext_nkjp-by-type" class="btn btn-seconday btn-lg active" >COMBO + fasttext + NLPrePL-fair-by-type </a></li>
57
  </ul>
58
 
59
 
 
11
 
12
  <div class="container">
13
  <h2 style="text-align: center;">NLPre-PL Dataset</h2>
14
+ <p>The official NLPre-PL dataset - a uniformly paragraph-level divided version of NKJP1M corpus – the 1-million token balanced subcorpus of the National Corpus of Polish (Narodowy Korpus Jezyka Polskiego).
15
  </p>
16
+ <p>
17
  The NLPre dataset aims at fairly dividing the paragraphs length-wise and topic-wise into train, development, and test sets. Thus, we ensure a similar number of segments distribution per paragraph and avoid the situation when paragraphs with a small (or large) number of segments are available only e.g. during test time.
18
  </p>
19
  <p>
20
  <a style="text-align: center;" href="https://huggingface.co/datasets/ipipan/nlprepl" target="_blank" class="btn btn-primary btn-lg active" role="button" aria-pressed="true">&#129303; NLPre-PL Dataset</a>
21
 
22
+ <a style="text-align: center;"href="http://git.nlp.ipipan.waw.pl/alina/PDBUD" target="_blank" class="btn btn-primary btn-lg active" role="button" aria-pressed="true">&#129303; PDB-UD Dataset</a>
23
  </p>
24
 
25
  <div><p></p></div>
 
34
 
35
  <p><b>UD TAGSET</b></p>
36
  <p>
37
+ <ul class="list-group list-group-light list-group-small">
38
+ <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_herBERT_pdb" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + PDB-UD</a></li>
39
+ <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_fasttext_pdb" target="_blank" class="btn btn-seconday btn-lg active" >COMBO + fasttext + PDB-UD </a></li>
40
+ <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_herBERT_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + NLPrePL-fair-by-name </a></li>
41
+ <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_herBERT_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + NLPrePL-fair-by-type </a></li>
42
+ <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_fasttext_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-name </a></li>
43
+ <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_fasttext_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-type </a></li>
 
 
 
44
  </ul>
45
  </p>
46
 
47
  <p><b>NKJP TAGSET</b></p>
48
  <p>
49
  <ul class="list-group list-group-light list-group-small">
50
+ <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_herBERT_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + NLPrePL-fair-by-name</a></li>
51
+ <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_herBERT_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" >COMBO + HerBERT + NLPrePL-fair-by-type </a></li>
52
+ <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_fasttext_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-name</a></li>
53
+ <li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_fasttext_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" >COMBO + fasttext + NLPrePL-fair-by-type </a></li>
54
  </ul>
55
 
56