|
<!DOCTYPE html> |
|
<html> |
|
<head> |
|
<title>Bootstrap Online Editor</title> |
|
<meta name="viewport" content="width=device-width, initial-scale=1"> |
|
<link rel="stylesheet" href="https://maxcdn.bootstrapcdn.com/bootstrap/4.0.0/css/bootstrap.min.css"> |
|
<script src="https://ajax.googleapis.com/ajax/libs/jquery/3.6.0/jquery.min.js"></script> |
|
<script src="https://maxcdn.bootstrapcdn.com/bootstrap/4.0.0/js/bootstrap.min.js"></script> |
|
</head> |
|
<body> |
|
|
|
<div class="container"> |
|
<hr> |
|
<h2 style="text-align: center;">NLPre-PL Dataset</h2> |
|
<hr> |
|
<p>The official NLPre-PL dataset - a uniformly paragraph-level divided version of NKJP1M corpus - the 1 million token balanced subcorpus of the National Corpus of Polish (Narodowy Korpus Jezyka Polskiego). |
|
</p> |
|
<p> |
|
The NLPre dataset aims at fairly dividing the paragraphs length-wise and topic-wise into train, development, and test sets. Thus, we ensure a similar number of segments distribution per paragraph and avoid the situation when paragraphs with a small (or large) number of segments are available only e.g. during test time. |
|
</p> |
|
<p> |
|
<a style="text-align: center;" href="https://huggingface.co/datasets/ipipan/nlprepl" target="_blank" class="btn btn-primary btn-lg active" role="button" aria-pressed="true">🤗 NLPre-PL Dataset</a> |
|
|
|
<a style="text-align: center;"href="http://git.nlp.ipipan.waw.pl/alina/PDBUD" target="_blank" class="btn btn-primary btn-lg active" role="button" aria-pressed="true">🤗 PDB-UD Dataset</a> |
|
</p> |
|
|
|
<div><p></p></div> |
|
|
|
<div class="container"> |
|
<hr> |
|
<h2 style="text-align: center;">NLPre-PL Trained models</h2> |
|
<hr> |
|
<p>Here are listed all available models, trained for the purpouse of creating NLPre-PL Benchmark.</p> |
|
|
|
<div class="accordion" id="accordionExample"> |
|
<div class="card"> |
|
<div class="card-header" id="headingOne"> |
|
<h5 class="mb-0"> |
|
<button class="btn btn-link" type="button" data-toggle="collapse" data-target="#collapseOne" aria-expanded="false" aria-controls="collapseOne"> |
|
🤗 COMBO |
|
</button> |
|
</h5> |
|
</div> |
|
|
|
<div id="collapseOne" class="collapse show" aria-labelledby="headingOne" data-parent="#accordionExample"> |
|
<hr> |
|
<h5 style="text-align: center;">UD TAGSET</h5> |
|
<hr> |
|
|
|
<ul class="list-group list-group-light list-group-small"> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_herBERT_pdb" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + PDB-UD</a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_herBERT_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + NLPrePL-fair-by-name </a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_herBERT_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + NLPrePL-fair-by-type </a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_fasttext_pdb" target="_blank" class="btn btn-seconday btn-lg active" >COMBO + fasttext + PDB-UD </a></li> |
|
|
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_fasttext_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-name </a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_ud_fasttext_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-type </a></li> |
|
</ul> |
|
|
|
|
|
<hr> |
|
<h5 style="text-align: center;">NKJP TAGSET</h4> |
|
<hr> |
|
|
|
<ul class="list-group list-group-light list-group-small"> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_herBERT_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + HerBERT + NLPrePL-fair-by-name</a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_herBERT_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" >COMBO + HerBERT + NLPrePL-fair-by-type </a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_fasttext_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > COMBO + fasttext + NLPrePL-fair-by-name</a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_combo_nkjp_fasttext_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" >COMBO + fasttext + NLPrePL-fair-by-type </a></li> |
|
</ul> |
|
|
|
</div> |
|
</div> |
|
<div class="card"> |
|
<div class="card-header" id="headingTwo"> |
|
<h5 class="mb-0"> |
|
<button class="btn btn-link collapsed" type="button" data-toggle="collapse" data-target="#collapseTwo" aria-expanded="false" aria-controls="collapseTwo"> |
|
🤗 SPACY |
|
</button> |
|
</h5> |
|
</div> |
|
<div id="collapseTwo" class="collapse" aria-labelledby="headingTwo" data-parent="#accordionExample"> |
|
<div class="card-body"> |
|
<hr> |
|
<h5 style="text-align: center;">UD TAGSET</h5> |
|
<hr> |
|
|
|
<ul class="list-group list-group-light list-group-small"> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_fasttext_pdb" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + fasttext + PDB-UD</a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_fasttext_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" >spaCy + fasttext + NLPrePL-fair-by-name </a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_fasttext_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + fasttext + NLPrePL-fair-by-name </a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_transformer_pdb" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + transformer + PDB-UD</a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_transformer_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + transformer + NLPrePL-fair-by-name </a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_transformer_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + transformer + NLPrePL-fair-by-type </a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_pl-core-news-lg_pdb" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + pl-core-news-lg + PDB-UD </a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_pl-core-news-lg_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + pl-core-news-lg + NLPrePL-fair-by-name </a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_ud_pl-core-news-lg_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + pl-core-news-lg + NLPrePL-fair-by-type </a></li> |
|
</ul> |
|
|
|
<hr> |
|
<h5 style="text-align: center;">NKJP TAGSET</h5> |
|
<hr> |
|
|
|
<ul class="list-group list-group-light list-group-small"> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_nkjp_fasttext_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + fasttext + NLPrePL-fair-by-name</a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_nkjp_fasttext_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + fasttext + NLPrePL-fair-by-type </a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_nkjp_transformer_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + transformer + NLPrePL-fair-by-name</a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_nkjp_transformer_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + transformer + NLPrePL-fair-by-type </a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_nkjp_pl-core-news-lg_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + pl-core-news-lg + NLPrePL-fair-by-name</a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_spacy_nkjp_pl-core-news-lg_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > spaCy + pl-core-news-lg + NLPrePL-fair-by-type </a></li> |
|
</ul> |
|
</div> |
|
</div> |
|
</div> |
|
<div class="card"> |
|
<div class="card-header" id="headingThree"> |
|
<h5 class="mb-0"> |
|
<button class="btn btn-link collapsed" type="button" data-toggle="collapse" data-target="#collapseThree" aria-expanded="false" aria-controls="collapseThree"> |
|
🤗 STANZA |
|
</button> |
|
</h5> |
|
</div> |
|
<div id="collapseThree" class="collapse" aria-labelledby="headingThree" data-parent="#accordionExample"> |
|
<div class="card-body"> |
|
<hr> |
|
<h5 style="text-align: center;">UD TAGSET</h5> |
|
<hr> |
|
|
|
<ul class="list-group list-group-light list-group-small"> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_stanza_ud_fasttext_pdb" target="_blank" class="btn btn-seconday btn-lg active" > Stanza + fasttext + PDB-UD</a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_stanza_ud_fasttext_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" >Stanza + fasttext + NLPrePL-fair-by-name </a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_stanza_ud_fasttext_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > Stanza + fasttext + NLPrePL-fair-by-type </a></li> |
|
|
|
</ul> |
|
|
|
|
|
<hr> |
|
<h5 style="text-align: center;">NKJP TAGSET</h5> |
|
<hr> |
|
|
|
<ul class="list-group list-group-light list-group-small"> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_stanza_nkjp_fasttext_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > Stanza + fasttext + NLPrePL-fair-by-name</a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_stanza_nkjp_fasttext_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > Stanza + fasttext + NLPrePL-fair-by-type </a></li> |
|
|
|
</ul> |
|
</div> |
|
</div> |
|
</div> |
|
<div class="card"> |
|
<div class="card-header" id="headingFour"> |
|
<h5 class="mb-0"> |
|
<button class="btn btn-link collapsed" type="button" data-toggle="collapse" data-target="#collapseFour" aria-expanded="false" aria-controls="collapseThree"> |
|
🤗 TRANKIT |
|
</button> |
|
</h5> |
|
</div> |
|
<div id="collapseFour" class="collapse" aria-labelledby="headingFour" data-parent="#accordionExample"> |
|
<div class="card-body"> |
|
<hr> |
|
<h5 style="text-align: center;">UD TAGSET</h5> |
|
<hr> |
|
|
|
<ul class="list-group list-group-light list-group-small"> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_ud_xlm-roberta-large_pdb" target="_blank" class="btn btn-seconday btn-lg active" > Trankit + xlm-RoBERTa-Large + PDB-UD</a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_ud_xlm-roberta-large_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" >Trankit + xlm-RoBERTa-Large + NLPrePL-fair-by-name</a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_ud_xlm-roberta-large_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > Trankit + xlm-RoBERTa-Large + NLPrePL-fair-by-type </a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_ud_xlm-roberta-base_pdb" target="_blank" class="btn btn-seconday btn-lg active" > Trankit + xlm-RoBERTa-Base + PDB-UD </a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_ud_xlm-roberta-base_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > Trankit + xlm-RoBERTa-Base + NLPrePL-fair-by-name </a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_ud_xlm-roberta-base_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" > Trankit + xlm-RoBERTa-Base + NLPrePL-fair-by-type </a></li> |
|
</ul> |
|
|
|
<hr> |
|
<h5 style="text-align: center;">NKJP TAGSET</h5> |
|
<hr> |
|
<ul class="list-group list-group-light list-group-small"> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_nkjp_xlm-roberta-large_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > Trankit + xlm-RoBERTa-Large + NLPrePL-fair-by-name</a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_nkjp_xlm-roberta-large_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" >Trankit + xlm-RoBERTa-Large + NLPrePL-fair-by-type </a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_nkjp_xlm-roberta-base_nkjp-by-name" target="_blank" class="btn btn-seconday btn-lg active" > Trankit + xlm-RoBERTa-Base + NLPrePL-fair-by-name</a></li> |
|
<li class="list-group-item"><a href="https://huggingface.co/ipipan/nlpre_trankit_nkjp_xlm-roberta-base_nkjp-by-type" target="_blank" class="btn btn-seconday btn-lg active" >Trankit + xlm-RoBERTa-Base + NLPrePL-fair-by-type </a></li> |
|
</ul> |
|
</div> |
|
</div> |
|
</div> |
|
</div> |
|
|
|
</div> |
|
|
|
</body> |
|
</html> |