Fsoft-AIC
/

videberta-base

Token Classification

Inference Endpoints

Model card Files Files and versions Community

videberta-base / README.md

Aehus's picture

Upload 6 files

a27c817 over 1 year ago

|

history blame contribute delete

1.88 kB

	---
	language:
	- vi
	metrics:
	- f1
	pipeline_tag: token-classification
	tags:
	- transformer
	- vietnamese
	- nlp
	- bert
	- deberta
	- deberta-v3
	---

	# ViDeBERTa: A powerful pre-trained language model for Vietnamese


	ViDeBERTa, a new pre-trained monolingual language model for Vietnamese,
	with three versions - ViDeBERTa_xsmall, ViDeBERTa_base, and ViDeBERTa_large,
	which are pre-trained on 138GB of Vietnamese text of high-quality and diverse Vietnamese text using DeBERTaV3 architecture.

	Please check the [official repository][github] for more implementation details and updates

	The DeBERTa V3 xsmall model comes with 12 layers and a hidden size
	of 384. It has only 22M backbone parameters with a vocabulary
	containing 128K tokens which introduces 48M parameters in the
	Embedding layer. This model was trained using CC100 dataset, which consists of 138 GB of Vietnamese text.

	## Fine-tuning on NLU tasks
	We present the dev results on VLSP POS, PhoNER, ViQuAD dataset.

	\| Model\|#Params(M)\| POS \| NER \| MRC \|
	\|-----------\|-------\|---------\|-----\|----------\|
	\| XLM-R-base \| 125M \| 96.2 \| - \| 82.0 \|
	\| XLM-R-large \| 355M \| 96.3 \| 93.8 \| 87.0 \|
	\| PhoBERT-base \| 135M \| 96.7 \| 80.1 \|
	\| PhoBERT-large \| 370M \| 96.8 \| 83.5 \|
	\| ViT5-base \| 310M \| - \| 94.5 \| - \|
	\| ViT5-large \| 866M \| - \| 93.8 \| - \|
	\| ViDeBERTa-xsmall \| 22M \| 96.4 \| 93.6 \| 81.3 \|
	\| ViDeBERTa-base \| 86M \| 96.8 \| 94.5 \| 85.7 \|
	\| ViDeBERTa-large \| 304M \| 97.2 \| 95.3 \| 89.9 \|

	## Citation
	If you find ViDeBERTa useful for your work, please cite the following papers:
	```latex
	@article{dao2023videberta,
	title={ViDeBERTa: A powerful pre-trained language model for Vietnamese},
	author={Dao Tran, Cong and Pham, Nhut Huy and Nguyen, Anh and Son Hy, Truong and Vu, Tu},
	journal={arXiv e-prints},
	pages={arXiv--2301},
	year={2023}
	}
	```

	[github]: https://github.com/HySonLab/ViDeBERTa