language: en | |
tags: | |
- transcription | |
- T5 | |
- huggingface | |
license: apache-2.0 | |
datasets: custom | |
model_type: t5 | |
# T5-based Audio Transcription Fusion Model | |
This model combines transcriptions from multiple sources separated by '/' to generate an optimal transcription. It is fine-tuned on a dataset where each sample has three candidate transcriptions and a reference transcription. | |
### Training Details | |
Model trained on 21000 samples for 10 epochs with T5-small as the base model. | |
Training Loss: 0.004994123708456755 | |
### Evaluation Details | |
Test Loss: 0.011637951454891172 | |
Word Error Rate (WER): 0.0726561850095666 | |