techiaith
/

wav2vec2-xlsr-53-ft-cy-en-withlm

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

DewiBrynJones commited on Aug 8

Commit

3d462e5

•

1 Parent(s): 8aaa6d9

Update README.md

Files changed (1) hide show

README.md +10 -8

README.md CHANGED Viewed

@@ -14,14 +14,16 @@ pipeline_tag: automatic-speech-recognition
 # wav2vec2-xlsr-53-ft-cy-en-withlm
-This model is a version of [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53)
-that has been fined-tuned with a custom bilingual datasets derived from the Welsh
-and English data releases of Mozilla Foundation's Commonvoice project. See : [techiaith/commonvoice_16_1_en_cy](https://huggingface.co/datasets/techiaith/commonvoice_16_1_en_cy).
-In addition, this model also includes a single KenLM n-gram model trained with balanced
-collections of Welsh and English texts from [OSCAR](https://huggingface.co/datasets/oscar)
-This avoids the need for any language detection for determining whether to use a Welsh or English n-gram models during CTC decoding.
 ## Usage

 # wav2vec2-xlsr-53-ft-cy-en-withlm
+An acoustic encoder model for Welsh and English speech recognition accompanied with a n-gram language model.
+The acoustic model is fine-tuned from
+[facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) using transcribed
+spontaneous speech from
+[techiaith/banc-trawsgrifiadau-bangor (v24.01)](https://huggingface.co/datasets/techiaith/banc-trawsgrifiadau-bangor/tree/24.01) and
+Welsh and English speech data derived from version 16.1 the Common Voice datasets [techiaith/commonvoice_16_1_en_cy](https://huggingface.co/datasets/techiaith/commonvoice_16_1_en_cy)
+The accompanying language model is a single KenLM n-gram model trained with a balanced
+collection of Welsh and English texts from [OSCAR](https://huggingface.co/datasets/oscar), thus avoiding language specific models
+and language detection during CTC decoding.
 ## Usage