vgaraujov
/

bart-base-translation-es-en

Text2Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

BARTO (base-sized model) for es-en translation

This model is a fine-tuned version of BARTO on a small portion of WMT13 es-en dataset. It achieves the following results on the evaluation set:

Loss: 1.4562
Bleu: 30.222
Gen Len: 42.0952

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 0.005
train_batch_size: 96
eval_batch_size: 96
seed: 42
gradient_accumulation_steps: 4
total_train_batch_size: 384
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 40000
training_steps: 5000

Framework versions

Transformers 4.33.0.dev0
Pytorch 2.0.1+cu117
Datasets 2.14.4
Tokenizers 0.13.3

Downloads last month: 12

Inference Examples

Text2Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for vgaraujov/bart-base-translation-es-en

Base model

vgaraujov/bart-base-spanish

Finetuned

(9)

this model

Dataset used to train vgaraujov/bart-base-translation-es-en

Collection including vgaraujov/bart-base-translation-es-en

Fine-tuned Spanish PLMs

4 items • Updated Mar 18

Evaluation results

Bleu on vgaraujov/wmt13 es-en
validation set self-reported

30.222

View on Papers With Code