Automatic Speech Recognition
Transformers
Safetensors
Welsh
English
wav2vec2
Inference Endpoints
DewiBrynJones commited on
Commit
3d462e5
1 Parent(s): 8aaa6d9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -8
README.md CHANGED
@@ -14,14 +14,16 @@ pipeline_tag: automatic-speech-recognition
14
 
15
  # wav2vec2-xlsr-53-ft-cy-en-withlm
16
 
17
- This model is a version of [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53)
18
- that has been fined-tuned with a custom bilingual datasets derived from the Welsh
19
- and English data releases of Mozilla Foundation's Commonvoice project. See : [techiaith/commonvoice_16_1_en_cy](https://huggingface.co/datasets/techiaith/commonvoice_16_1_en_cy).
20
-
21
- In addition, this model also includes a single KenLM n-gram model trained with balanced
22
- collections of Welsh and English texts from [OSCAR](https://huggingface.co/datasets/oscar)
23
- This avoids the need for any language detection for determining whether to use a Welsh or English n-gram models during CTC decoding.
24
-
 
 
25
 
26
  ## Usage
27
 
 
14
 
15
  # wav2vec2-xlsr-53-ft-cy-en-withlm
16
 
17
+ An acoustic encoder model for Welsh and English speech recognition accompanied with a n-gram language model.
18
+ The acoustic model is fine-tuned from
19
+ [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) using transcribed
20
+ spontaneous speech from
21
+ [techiaith/banc-trawsgrifiadau-bangor (v24.01)](https://huggingface.co/datasets/techiaith/banc-trawsgrifiadau-bangor/tree/24.01) and
22
+ Welsh and English speech data derived from version 16.1 the Common Voice datasets [techiaith/commonvoice_16_1_en_cy](https://huggingface.co/datasets/techiaith/commonvoice_16_1_en_cy)
23
+
24
+ The accompanying language model is a single KenLM n-gram model trained with a balanced
25
+ collection of Welsh and English texts from [OSCAR](https://huggingface.co/datasets/oscar), thus avoiding language specific models
26
+ and language detection during CTC decoding.
27
 
28
  ## Usage
29