Speech Recognition
Collection
9 items
•
Updated
This model is a fine-tune of openai/whisper-tiny using custom splits from Common Voice 16.1 Welsh and English datasets as well as normalized verbatim transcriptions from techiaith/banc-trawsgrifiadau-bangor
Due to its small size, this model is intended to be used as the basis for offline speech recognition on devices such as Android phones.
It achieves the following results on the evaluation set:
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss | Wer |
---|---|---|---|---|
0.8115 | 1.41 | 1000 | 0.8426 | 60.0795 |
0.6396 | 2.83 | 2000 | 0.7508 | 54.4259 |
0.5259 | 4.24 | 3000 | 0.7255 | 53.1328 |
0.4854 | 5.66 | 4000 | 0.7176 | 53.1135 |