DewiBrynJones
commited on
Commit
•
3d462e5
1
Parent(s):
8aaa6d9
Update README.md
Browse files
README.md
CHANGED
@@ -14,14 +14,16 @@ pipeline_tag: automatic-speech-recognition
|
|
14 |
|
15 |
# wav2vec2-xlsr-53-ft-cy-en-withlm
|
16 |
|
17 |
-
|
18 |
-
|
19 |
-
|
20 |
-
|
21 |
-
|
22 |
-
|
23 |
-
|
24 |
-
|
|
|
|
|
25 |
|
26 |
## Usage
|
27 |
|
|
|
14 |
|
15 |
# wav2vec2-xlsr-53-ft-cy-en-withlm
|
16 |
|
17 |
+
An acoustic encoder model for Welsh and English speech recognition accompanied with a n-gram language model.
|
18 |
+
The acoustic model is fine-tuned from
|
19 |
+
[facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) using transcribed
|
20 |
+
spontaneous speech from
|
21 |
+
[techiaith/banc-trawsgrifiadau-bangor (v24.01)](https://huggingface.co/datasets/techiaith/banc-trawsgrifiadau-bangor/tree/24.01) and
|
22 |
+
Welsh and English speech data derived from version 16.1 the Common Voice datasets [techiaith/commonvoice_16_1_en_cy](https://huggingface.co/datasets/techiaith/commonvoice_16_1_en_cy)
|
23 |
+
|
24 |
+
The accompanying language model is a single KenLM n-gram model trained with a balanced
|
25 |
+
collection of Welsh and English texts from [OSCAR](https://huggingface.co/datasets/oscar), thus avoiding language specific models
|
26 |
+
and language detection during CTC decoding.
|
27 |
|
28 |
## Usage
|
29 |
|