wav2vec2-xls-r-300m-br
This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on Mozilla Common Voice 15 Breton dataset and Roadennoù dataset. It achieves the following results on the MCV15-br test set:
- Wer: 41.0
- Cer: 14.7
Model description
This model was trained to assess the performance wav2vec2-xls-r-300m for fine-tuning a Breton ASR model.
Intended uses & limitations
This is a research model. Usage for production is not recommended.
Training and evaluation data
The training dataset consists of MCV15-br train dataset and 90% of the Roadennoù dataset. The validation dataset consists of MCV15-br validation dataset and the remaining 10% of the Roadennoù dataset. The final test dataset consists of MCV15-br test dataset.
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 6e-05
- train_batch_size: 8
- eval_batch_size: 8
- seed: 42
- gradient_accumulation_steps: 2
- total_train_batch_size: 16
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- lr_scheduler_warmup_steps: 500
- num_epochs: 40
- mixed_precision_training: Native AMP
Framework versions
- Transformers 4.39.1
- Pytorch 2.0.1+cu117
- Datasets 2.18.0
- Tokenizers 0.15.2
- Downloads last month
- 54
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.
Model tree for gweltou/wav2vec2-xls-r-300m-br
Base model
facebook/wav2vec2-xls-r-300mEvaluation results
- WER on common_voice_15_0self-reported41.000
- CER on common_voice_15_0self-reported14.700