File size: 712 Bytes
2ed5a5d
 
ac511fa
 
 
 
 
 
 
 
 
 
 
2ed5a5d
 
ac511fa
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
---
license: mit
language:
- zh
- ami
metrics:
- bleu
base_model:
- facebook/nllb-200-3.3B
pipeline_tag: translation
library_name: transformers
tags:
- linguistics
---

## amis-zh-3.3BM

This model is a fine tuned version of the facebook/nllb-200-3.3B parameter undistilled model for the Amis language. 

The model was fit with a custom tokenizer and language tag for Amis and additional Traditional Chinese tokens. 

The model was trained on roughly ~90k sentence pairs for 55k Epochs until validation loss plateaued

## Evaluation 

The mode was evaluated on 1,500 test sentence pairs bidirectionally 

#### BLEU 
* Amis -> Chinese: **27.65** 33.1/30.0/26.0/22.6
* Chinese -> Amis: **16.83** 38.2/18.6/12.1/9.3