ixxan commited on
Commit
f151411
1 Parent(s): 45c6612

readme model

Browse files
Files changed (2) hide show
  1. README.md +52 -0
  2. UModel_last.pth +3 -0
README.md ADDED
@@ -0,0 +1,52 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ language:
4
+ - ug
5
+ license: apache-2.0
6
+ base_model:
7
+ tags:
8
+ - generated_from_trainer
9
+ datasets:
10
+ - mozilla-foundation/common_voice_15_0
11
+ metrics:
12
+ - wer
13
+ model-index:
14
+ - name: Whisper Small Uyghur Common Voice 15
15
+ results:
16
+ - task:
17
+ name: Automatic Speech Recognition
18
+ type: automatic-speech-recognition
19
+ dataset:
20
+ name: Common Voice 15
21
+ type: mozilla-foundation/common_voice_15_0
22
+ metrics:
23
+ - name: Wer
24
+ type: wer
25
+ value: 34.99609273248242
26
+ ---
27
+
28
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
29
+ should probably proofread and complete it, then remove this comment. -->
30
+
31
+ # Uyghur Automatic Speech Recognition
32
+
33
+ Uyghur ASR using CTC loss trained with THUYG20 dataset
34
+ It achieves the following results on the evaluation set:
35
+ - Best CER: 7.21%
36
+
37
+ ## Training procedure
38
+
39
+ ### Training hyperparameters
40
+
41
+ The following hyperparameters were used during training:
42
+ - learning_rate: 0.0001
43
+ - train_batch_size: 20
44
+ - eval_batch_size: 20
45
+ - seed: 42
46
+ - optimizer: Adam with weight_decay=0.000001
47
+
48
+ ### Training results
49
+
50
+ Best CER: 7.21%
51
+ Trained: 473 epochs
52
+ The model has 26,389,282 trainable parameters
UModel_last.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:df2efe5174d097bbcf0e1d1f77c67deb44b017e3bfce8fd12281f4bf141a3031
3
+ size 105687599