huseinzol05's picture
Create README.md
41e211e
|
raw
history blame
1.12 kB
---
language:
- ms
tags:
- paraphrase
metrics:
- sacrebleu
---
# finetune-paraphrase-t5-small-standard-bahasa-cased
Finetuned T5 small on MS paraphrase tasks.
## Dataset
1. translated PAWS, https://huggingface.co/datasets/mesolitica/translated-PAWS
2. translated MRPC, https://huggingface.co/datasets/mesolitica/translated-MRPC
## Finetune details
1. Finetune using single RTX 3090 Ti.
Scripts at https://github.com/huseinzol05/malaya/tree/master/session/paraphrase/hf-t5
## Supported prefix
1. `parafrasa: {string}`, for MS paraphrase.
## Evaluation
Evaluated on MRPC validation set and PAWS test set.
```
{'name': 'BLEU',
'score': 61.559202822392486,
'_mean': -1.0,
'_ci': -1.0,
'_verbose': '86.0/68.4/56.1/46.3 (BP = 0.984 ratio = 0.984 hyp_len = 138806 ref_len = 141004)',
'bp': 0.9842896648933511,
'counts': [119394, 90149, 69905, 54500],
'totals': [138806, 131732, 124658, 117584],
'sys_len': 138806,
'ref_len': 141004,
'precisions': [86.0150137602121,
68.43363799228737,
56.07742784257729,
46.34984351612464],
'prec_str': '86.0/68.4/56.1/46.3',
'ratio': 0.9844117897364614}
```