projecte-aina
/

aina-translator-it-ca

Fairseq

Italian

Catalan

Model card Files Files and versions Community

fdelucaf commited on Dec 14, 2023

Commit

2c1826c

•

1 Parent(s): b26c9cd

Update README.md

Browse files

Files changed (1) hide show

README.md +16 -1

README.md CHANGED Viewed

@@ -109,30 +109,45 @@ The following hyperparameters were set on the Fairseq toolkit:
 | Warmup updates                 	| 8000                         	|
 | Dropout                        	| 0.1                          	|
 | Label smoothing                	| 0.1                          	|
 The model was trained for a total of 19.000 updates. Weights were saved every 1000 updates and reported results are the average of the last 4 checkpoints.
 ## Evaluation
 ### Variable and metrics
-We use the BLEU score for evaluation on the Flores test set: [Flores-101](https://github.com/facebookresearch/flores),
 ### Evaluation results
 Below are the evaluation results on the machine translation from Catalan to Italian compared to [Softcatalà](https://www.softcatala.org/) and [Google Translate](https://translate.google.es/?hl=es):
 | Test set         	| SoftCatalà | Google Translate |mt-aina-it-ca|
 |----------------------|------------|------------------|---------------|
 | Flores 101 dev   	| 25,4     	| **30,4**     	| 27,5     	|
 | Flores 101 devtest   |26,6   	| **31,2**     	| 27,7     	|
 | NTREX | 29,3 | **33,5** | 30,7 |
 | Average          	| 27,1  	| **31,7**     	| 28,6      	|
 ## Additional information
 ### Author
 Language Technologies Unit (LangTech) at the Barcelona Supercomputing Center.
 ### Contact information
 For further information, send an email to <langtech@bsc.es>
 ### Copyright
 Copyright Language Technologies Unit at Barcelona Supercomputing Center (2023)
 ### Licensing information
 This work is licensed under a [Apache License, Version 2.0](https://www.apache.org/licenses/LICENSE-2.0)
 ### Funding
 This work was funded by the Departament de la Vicepresidència i de Polítiques Digitals i Territori de la Generalitat de Catalunya within the framework of Projecte AINA.
 ### Disclaimer
 <details>
 <summary>Click to expand</summary>
 The models published in this repository are intended for a generalist purpose and are available to third parties. These models may have bias and/or any other undesirable distortions.

 | Warmup updates                 	| 8000                         	|
 | Dropout                        	| 0.1                          	|
 | Label smoothing                	| 0.1                          	|
 The model was trained for a total of 19.000 updates. Weights were saved every 1000 updates and reported results are the average of the last 4 checkpoints.
 ## Evaluation
 ### Variable and metrics
+We use the BLEU score for evaluation on the [Flores-101](https://github.com/facebookresearch/flores), and [NTREX](https://github.com/MicrosoftTranslator/NTREX) evaluation datasets.
 ### Evaluation results
 Below are the evaluation results on the machine translation from Catalan to Italian compared to [Softcatalà](https://www.softcatala.org/) and [Google Translate](https://translate.google.es/?hl=es):
 | Test set         	| SoftCatalà | Google Translate |mt-aina-it-ca|
 |----------------------|------------|------------------|---------------|
 | Flores 101 dev   	| 25,4     	| **30,4**     	| 27,5     	|
 | Flores 101 devtest   |26,6   	| **31,2**     	| 27,7     	|
 | NTREX | 29,3 | **33,5** | 30,7 |
 | Average          	| 27,1  	| **31,7**     	| 28,6      	|
 ## Additional information
 ### Author
 Language Technologies Unit (LangTech) at the Barcelona Supercomputing Center.
 ### Contact information
 For further information, send an email to <langtech@bsc.es>
 ### Copyright
 Copyright Language Technologies Unit at Barcelona Supercomputing Center (2023)
 ### Licensing information
 This work is licensed under a [Apache License, Version 2.0](https://www.apache.org/licenses/LICENSE-2.0)
 ### Funding
 This work was funded by the Departament de la Vicepresidència i de Polítiques Digitals i Territori de la Generalitat de Catalunya within the framework of Projecte AINA.
 ### Disclaimer
 <details>
 <summary>Click to expand</summary>
 The models published in this repository are intended for a generalist purpose and are available to third parties. These models may have bias and/or any other undesirable distortions.